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[ij [0003] Orchestration of the cell division cycle includes a series of checkpoints which 

"2 ensure that some events are completed before others begin (Murray, A. et al. The Cell Cycle 
20 Oxford University Press (1993). One set of controls determines whether cells replicate their 
genome in preparation for division (Gl/S), while another checks that DNA replication is 
complete and that the cell has grown sufficiently for division to take place (G2/M) (Nasmyth, K. 
Science 274:1643-1645 (1996). Cyclins and cyclin-dependent kinases (CDKs) regulate these 
events in part by controlling the transcription of specific effector genes (Okayama, H. et al., Adv. 
25 Cancer Res. 69:17-62 (1996); Sanchez, I. et al., Curr. Opin. Cell. Biol. 8:318-824 (1996)). In 
budding yeast, CDC28 regulates the transcription of genes whose products are needed for the 
Gl/S transition or S phase (Andrews, B.J. et al., J. Biol. Chem. 265:14057-14060 (1990); 
Johnston, L.H. et al., Nucl. Acids Res. 20:2403-2010 (1992)) via the transcription factors SBF 
(Swi4-Swi6) and DSC1 (Swi6-Mbpl) (Andrews, B.J. et al., Cell 57:21-29 (1989); Dirick, L. et 
30 al., Nature 357:508-513 (1992); Lowndes, N.F. et al., Nature 357:505-508 (1992) ; Lowndes, 
N.F. et al., Nature 350:247-250 (1991); Taba, M.R. et al., Genes Dev. 5:2000-2013 (1991)). In 
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fission yeast, the SBF-related heterodimer, MBF, is required for the expression of similar genes 
(Aves, S.J. et al., EMBO J. 4:457-463 (1985); Lowndes, N.F. et al, Nature 357:505-508 (1992); 
Tanaka, K. et al., EMBO J. 11:4923-4932 (1992)). In mammalian cells, the CDK-regulated 
transcription factor E2F plays a key role in regulating the Gl/S transition (Muller, R., Trends 
5 Genet. 11:173-178 (1995); Sanchez, I et al., Curr. Opin, Cell. Biol. 8:318-824 (1996)). E2F is 
complexed with Rb until Rb phosphorylation by Gl cyclin-dependent kinases releases E2F to 
activate transcription of immediate early genes including myc, fos, and jun (Beijersbergen, R.L. 
et al., Biochim. Biophys. Acta 1287:103-120 (1996)). 

[0004] Similar cyclin-dependent control mechanisms regulate the G2/M transition 
10 (Forsberg, S.L. et al., Annu. Rev. Cell. Biol. 7:227-256 (1991); Nurse, P., Cell 79:547-550 
O (1994); Nurse, P., Nature 344:503-508 (1990)), but less is known about their downstream targets 
si (Stukenberg, P.T. et al., Curr. Biol. 7:338-348 (1997)). In fission yeast, regulation of the Cdc2- 
u : Cdcl3 cyclin-dependent kinase-cyclin complex by the Weel kinase and Cdc25 phosphatase is 
O thought to be the primary mechanism controlling G2/M (Okayama, H. et al., Adv. Cancer Res. 
ft 69:17-62 (1996); Russell, P. et al., Cell 49:559-567 (1987)). The Cdc2-Cdcl3 complex 
;i_ accumulates during S phase, but Cdc2 is phosphorylated and thereby maintained in an inactive 
P state by Weel (Fleig, U.N. et al., Semin. Cell. Biol. 2:195-205 (1991); Lundgren, K. et al., Cell 
J;2 65:1111-1122 (1991)). As cells complete DNA replication, Weel is phosphorylated by Niml 
Q and thereby inactivated (Russell, P. et al., Cell 49:559-567 (1987)), and Cdc25 accumulation 
'20 leads to the dephosphorylation of Cdc2 (Gautier, J. et al., Cell 67:197-21 1 (1991); Moreno, S. et 
al., Nature 344:549-552 (1990)). Dephosphorylation and activation of Cdc2 heralds progression 
through G2 and entry into mitosis. The biochemical events controlling G2/M transit in 
mammalian cells are remarkably similar to those in S. pombe. Mammalian Cdc2 kinase 
accumulates in S phase (Shimizu, M. et al., Mol. Cell. Biol. 15:2882-2892 (1995)) and is 
25 regulated by a Weel kinase (Parker, L.L. et al., Science 257:1955-1957 (1992)) and Cdc25 
phosphatase (Honda, R. et al., FEBS Lett. 318:331-334 (1993)). While G2/M progression 
requires the coordinated expression of many genes, how this is regulated at the level of 
transcription remains largely unknown. The identification and characterization of transcription 
factors regulating G2 progression and mitotic entry, therefore, would significantly advance our 
30 understanding of the mechanisms controlling this portion of the cell cycle. 
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[0005] Most mammalian cells, such as hepatocytes, reside in GO and can re-enter the 
cell cycle and undergo mitosis. Significant exceptions to this general rule include skeletal and 
cardiac myocytes, which are terminally differentiated and apparently incapable of undergoing 
mitosis shortly after the postnatal period. Tam et al. disclosed the possibility that reversal of 
5 terminal differentiation in cardiac myocytes might be achieved by manipulation of pocket 
proteins and/or cyclin D and cdk2 expression and function (Annals NY Acad Sci. 752: 72-79 
(1995). Kirshenbaum et al. (J. Biol. Chem. 270:7791-7794 (1995)) disclosed the reactivation of 
DNA synthesis, but not proliferation, of cardiac myocytes by the adenoviral protein El A in 
concert with E1B delivered via an adenovirus vector. Kirshenbaum et al. (Dev. Biol. 179:402- 
10 41 1 (1996)) disclosed that E2F-1 delivered via an adenovirus vector together with E1B can also 
Q activate DNA synthesis and cause the accumulation of cardiac myocytes in G2/M. 
C\ [0006] S. pombe cdc5p was first described as a putative DNA binding protein 

(A implicated in G2/M transit (Nasmyth, K. et al. (1981) Mol Gen Genet 182, 119-24). We 
O subsequently identified a cDNA encoding a protein with limited homology to S. pombe cdc5p 
'|J5 (Bernstein, H. S. et al. (1997) J. Biol. Chem. 272, 5833-7). Its widespread expression in human 
tissues and homology with expressed sequences in other eukaryotes suggested an evolutionarily 
!=& conserved general function (Bernstein, H. S. et al. (1997) J. Biol. Chem. 272, 5833-7). Nuclear 
import upon serum stimulation of mammalian cells suggested a possible role in cell proliferation 
Q (Bernstein, H. S. et al. (1997) J. Biol. Chem. 272, 5833-7). 

' 20 [0007] Effector genes regulated by other members of the Cdc5 family in S. cerevisiae 

(Ohi, R. et al. (1998) Mol Cell Biol 18, 4097-108), A. thaliana (Hirayama, T. et al (1996) Proc 
Natl Acad Sci U S A 93, 13371-6), C. elegans (Bernstein, H. S. et al (1997) J Biol Chem. 272, 
5833-7), D. melanogaster (Katzen, A. L. et al. (1998) Genes Dev 12, 831-43; Ohi, R. et al (1998) 
Mol Cell Biol 18, 4097-108), and M. musculus (Bernstein, H. S. et al. (1997) J Biol Chem 272, 

25 5833-7) have not been identified. Cdc5-related proteins contain tandem helix-turn-helix DNA 
binding motifs at their amino termini, similar to that seen in Myb-related proteins (Bernstein, H. 
S. et al. (1997) J Biol Chem 272, 5833-7). In contrast with c-Myb, however, Cdc5-related 
proteins contain only two repeats of the helix-turn-helix motif, whereas Myb family members 
possess three (Bernstein, H. S. et al. (1997) J Biol Chem 272, 5833-7; Hirayama, T. et al. (1996) 

30 Proc Natl Acad Sci USA 93, 13371-6; Katzen, A. L. et al. (1998) Genes Dev 12, 831-43; Ohi, R. 
et al. (1998) Mol Cell Biol 18, 4097-108; Ohi, R. et al. (1994) EMBO J 13, 471-83). Moreover, 
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within this domain Cdc5-related proteins bear a Val->Leu substitution at a position critical for 
DNA binding specificity (Carr, M. D. et al. (1996) Eur J Biochem 235, 721-35; Ogata, K. et al. 
(1996) Nat Struct Biol 3, 178-87). Cdc5 family members, therefore, likely differ from Myb in 
their DNA binding properties. 
5 [0008] Recently a 7bp nucleotide sequence identified by random oligonucleotide 

binding site selection was shown to interact with the DNA binding domain of A. thaliana in 
vitro, however, binding with this sequence was reduced with non-specific competitor DNA 
(Hirayama, T. et al. (1996) Proc Natl Acad Sci USA 93, 13371-6). Similar experiments to 
identify a consensus binding site for the highly conserved DNA binding domains of D. 
10 melanogaster Cdc5 and the Cdc5 homologue in S. cerevisiae, Ceflp, failed to identify any 
s-i preferential site, nor did they interact with the 7 bp sequence identified for A. thaliana (Ohi, R. et 
'0 al. (1998) Mol Cell Biol 18, 4097-108). In addition, others have shown that Cdc5 does not 
11! activate the transcription of candidate genes known to be upregulated during G2/M, for example 
1% cdc2 and String in D. melanogaster (Katzen, A. L. et al. (1998) Genes Dev 12, 831-43), Clbl 
% and Swi5 in S. cerevisiae (Ohi, R. et al. (1998) Mol Cell Biol 18, 4097-108), and cdcJ3 + and 
cdc25 + in S. pombe (Ohi, R. et al. (1998) Mol Cell Biol 18, 4097-108). These results raised the 
J := f question of whether Cdc5 family members serve functions other than as site-specific DNA 
O binding proteins (Ohi, R. et al. (1998) Mol Cell Biol 18, 4097-108), even though they possess a 
pi Myb-like DNA binding domain. In addition, recent studies in fission and budding yeast have 
"2t) implicated a role for cdc5p and Ceflp, respectively, in pre-mRNA splicing (McDonald, W. H. et 
al. (1999) Mol Cell Biol 19, 5352-62; Tsai, W. Y. et al. (1999) J Biol Chem 274, 9455-62), and 
hCdc5 has been identified as a component of the mammalian splicesome (Burns, C. G. et al. 
(1999) Proc Natl Acad Sci USA 96, 13789-13794; Neubauer, G. et al. (1998) Nature Genet 20, 
46-50). 

25 

SUMMARY OF THE INVENTION 
[0009] One aspect of the invention is an isolated nucleic acid having the sequence of 
Figure 2D (SEQ ID NO: 1 1). 

[0010] A further aspect of the invention is an antisense nucleic acid comprising a 
30 nucleic acid sequence complementary to the nucleic acid sequence of Figure 2D (SEQ ID NO: 
11). 
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[0011] A further aspect of the invention is a method for treating a cell cycle defect in 
patient comprising administering to cells in the patient a therapeutic amount of an hCdc5 protein. 

[0012] A further aspect of the invention is a method for treating a cell cycle defect in a 
patient comprising administering to cells in the patient an antagonist of hCdc5. 
5 [0013] A further aspect of the invention is a method of treating a patient having a 

hyperproliferative disease, comprising administering to hyperproliferative cells in the patient 
nucleic acid comprising an hCdc5 nucleic acid, wherein the polypeptide encoded by the nucleic 
acid is over-expressed, and as a result of the over-expressed polypeptide the cells die. 

[0014] A further aspect of the invention is a method of regulating the progression of a 
10 cell cycle through G2 and into mitosis, comprising administering to a cell an antagonist of 
O hCdc5. 

*Q [0015] A further aspect of the invention is a polypeptide having the amino acid 

Iji sequence of Figure 1 A (SEQ ID NO: 1). 

& [0016] A further aspect of the invention is a hCdc5 binding site nucleic acid and vectors 

! f 5 comprising a hCdc5 binding site nucleic acid. In particular embodiments, the vector comprises 
the hCdc5 binding site nucleic acid operably linked to nucleic acid encoding a protein of interest, 
rf Also included is a method of using the vector for expressing the protein of interest in a cell in 
O which hCdc5 is expressed and a method of detecting the expression of hCdc5 in a cell. 
p% [0017] Another aspect of the invention is a method for identifying a compound that acts 

l= 20 as an effector of the interaction between hCdc5 and the hCdc5 DNA binding site. The effector 
compound can be an inhibitor or an enhancer of the interaction. Inhibitors identified by the 
method are useful for treating a patient having a hyperproliferative disease. Enhancers identified 
by the method are useful for promoting cell division in normally non-regenerable cells. The 
combination of hCdc5 protein and its DNA binding site nucleic acid is also useful for controlling 
25 the regulation of a recombinant gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] Figure 1A depicts the amino acid sequence (SEQ ID NO:l) of the 802-amino 
acid open reading frame of hCdc5 derived from HeLa cDNA clone 67.1. The thick solid 
30 underline denotes putative DNA binding domains; the thinner solid underline denotes putative 
nuclear localization signals; the broken underline denotes the putative activating domain. Shown 
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are potential phosphorylation sites for casein kinase II ("•"), MAP kinase ( • ), protein kinase A 
(■), and protein kinase C (♦). 

[0019] Figure IB is a diagram depicting the proposed functional domains of hCdc5. R2 
and R3 refer to similar repeats of the helix-turn-helix DNA binding motif seen in a-, b-, and c- 
5 Myb. The amino acid number is indicated above the map. NLSs denotes nuclear localization 
signals. 

[0020] Figure 2A depicts dot matrix plots demonstrating homology with S. pombe Cdc5 
as well as expressed sequence tags in M. musculus, C. elegans, and S. cerevisiae. Contiguous 
amino acids, as indicated by coordinates, were aligned using a Pustell homology matrix with 
10 window size = 8, minimum percent score = 60, and hash value = 2. 

O [0021] Figure 2B depicts a sequence alignment demonstrating amino-terminal 

5 homology of hCdc5 (SEQ ID NO: 2) with S. pombe Cdc5 (SEQ ID NO:3) and the Myb family 
Y\ DNA binding domain. Whereas a-, b-, and c-Myb (SEQ ID NO: 5, 4 and 6 respectively) contain 

''s-'i 

O three tandem repeats of the helix-turn-helix motif (Rl, R2, R3), hCdc5 and S. pombe Cdc5 
'3 5 contain two tandem repeats (R2, R3). R2 is shown. Identical amino acids are indicated by 

5i boxes. Gaps (-) were introduced to maximize alignment. Conserved tryptophan residues 

G 

|=i characteristic of the helix-turn-helix motif (Nelson, H.C., Curr. Opin. Genet. Der. 5:180-189 
(1995) are indicated in bold. The Val->Leu substitution in the R2 DNA binding cavity, 
O important for DNA binding specificity in Myb-family members (Ogata, K. et al., Nature 
*~20 Structural Biology 3:178-187 (1996)) is indicated by • . 

[0022] Figure 2C depicts a sequence alignment of the central, hydrophilic region of 
hCdc5 (SEQ ID NO:7), a putative activating domain, and the activating domains of a- and b- 
Myb (SEQ ID NO: 10 and 9, respectively). Prolines are highlighted in bold. Gaps (-) were 
introduced to maximize alignment. 
25 [0023] Figure 2D depicts the nucleotide sequence of hCdc5 cDNA (SEQ ID NO: 1 1). 

[0024] Figure 3 is a Northern blot of poly(A) mRNA probed with a 2.85-kb cDNA for 
hCdc5 (top) and with cDNA for P-actin as a control for the amount of mRNA loaded in each 
lane (bottom). 

[0025] Figure 4A is a Western blot using M2 monoclonal antibody to detect epitope- 
30 tagged hCdc5 in transfected COS-7 cells (COS-7, untransfected cells; pcDNA3, expression 
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vector only; pcD67F, vector directing HCdc5 expression) The 105-kDa transfection-dependent 
band is designated hCdc5. 

[0026] Figure 4B depicts an autoradiograph of hCdc5 immunoprecipitates resolved by 
sodium dodecyl sulfate-polyacrylamide gel electrophoresis (top). The transfection-dependent 
band at 105 kDa is designated hCdc5. In the lower panel the immunoprecipitates were resolved 
by gel electrophoresis and analyzed using M2 monoclonal antibody (bottom). 

[0027] Figure 5 is an immunoblot of dividing 3T1 5.8.22 fibroblasts, stably transformed 
to express hCdc5 under control of the bacterial tetracycline operator, grown in the presence or 
absence of tetracycline for the times indicated. Whole cell lysates were analyzed by 
immunoblotting with monoclonal antibodies to either the endogenous protein (Bl) or FLAG- 
epitope present on recombinant hCdc5 (M2). While the recombinant protein is absent from the 
3T15.8 parental cell line, it is expressed at levels about 2-fold above the endogenous protein by 
36 hours after tetracycline withdrawal, as assessed by densitometry. A non-specific protein (*) 
recognized by the secondary goat anti-mouse antibody is included as a control to demonstrate the 
amount of protein loaded in each lane. 

[0028] Figure 6 comprises panels A-H and depicts the results of flow cytometry of 
3T15.8.22 cultures synchronized in GO by growth for 48 hours in media made 0.5% serum with 
or without tetracycline, released from quiescence by addition of serum to 10%, and analyzed at 
indicated timepoints. Cells were stained with propidium iodide and Hoechst 33342 to determine 
viability and DNA content, respectively. The experiment was repeated 5 times, with a 
representative study shown. 

[0029] Figure 7 is a graph depicting actively dividing 3T1 5.8.22 and parental 3T15.8 
parental cultures plated at about 1250 cells/cm2, grown in the presence or absence of 
tetracycline, and counted at indicated timepoints. Data represent means from 4 experiments +/- 
sem. Calculated cell cycle lengths are 26 hours for 3T15.8 cultures either in the presence or 
absence of tetracycline, 28 hours and 19 hours for 3T1 5.8.22 cultures in the presence or absence 
of tetracycline, respectively. 

[0030] Figure 8 is a graph depicting alteration of cell cycle distribution in asynchronous 
cultures resulting from overexpression of hCdc5. Actively dividing 3T1 5.8.22 and 3T15.8 
cultures were grown in the presence or absence of tetracycline for 72 hours, stained with 
propidium iodide and Hoechst 33342 to determine viability and DNA content, respectively, and 
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analyzed for cell cycle distribution by flow cytometry. Phase lengths were calculated from the 
percentage of cells in each phase determined by flow cytometry and the cell cycle length 
calculated from the growth curves in Figure 7. Data represent means from 4 experiments +/- 
sem. 

5 [0031] Figure 9 is a graph depicting the overcoming of a G2/M size checkpoint by 

overexpression of hCdc5. 3T1 5.8.22 and 3T15.8 cultures were synchronized in GO, released, 
and analyzed by flow cytometry as described in Figure 6. Forward light scatter (Fsc) is plotted 
as a function of time. Data represent means from 3 experiments +/- sem. Average cell size in 
serum-deprived cultures overexpressing hCdc5 gradually decreases as cells reach GO. Upon 
10 release into Gl, 3T1 5.8.22 cells grown in the absence of tetracycline remain smaller, with no 
O significant increase in size over the course of the cell cycle compared with 3T1 5.8.22 cells 
Q grown in the presence of tetracycline (p=0.05). 

*f\ [0032] Figure 10 is a schematic representation of hCdc5 protein indicating the wild- 

O type (hCdc5) and deletion mutant (A675) tested. 

MS [0033] Figure 11 is a graph depicting the alteration of cell cycle distribution in 

^ asynchronous cultures resulting from expression of mutant A675. Actively dividing Jurkat 

cultures were co-transfected with a vector directing the expression of green fluorescent protein 
m (GFP) and wild-type or mutant hCdc5, or empty vector (pcDNA3), and grown for 72 hours. 10 6 

cells were stained with propidium iodide and Hoechst 33342 to determine viability and DNA 
20 content, respectively, and 10 5 cells were sorted by flow cytometry for GFP expression. 2 x 10 4 

GFP-positive cells were analyzed for cell cycle distribution. A representative experiment is 

shown. 

[0034] Figure 12 is a graph depicting the delay of G2/M progress by mutant A675. 
Jurkat cells were transfected and analyzed as in Figure 1 1 . Cell cycle distributions for GFP- 
25 expressing transfectants were included in this analysis. Relative amounts of each vector used in 
transfection are shown. Data represent means from 3 experiments +/- sem. An increase or 
decrease in distribution toward G2/M is seen with mutant or wild-type hCdc5, respectively, 
compared with empty vector or untransfected cells. 

[0035] Figure 13 is a graph depicting the specific delay of mitotic entry by mutant 
30 hCdc5. Jurkat cells were transfected as in Figure 11. At 48 hours after transfection, nocodazole 
was added to 0.5 u.g/ml, and cells were stained with Hoechst 33342 at timepoints indicated. For 
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each experiment, slides containing 5 x 10 3 cells were prepared in triplicate from each 
transfection and 200 GFP-positive cells per slide were assessed for the presence of chromosomal 
condensation. Prophase index is plotted as a function of time. Data represent means from 
duplicate experiments +/- sem. 
5 [0036] Figure 14A is an immunoblot depicting temporal expression of endogenous 

hCdc5. BALBc/3T3 cultures were quiesced in low serum for 48 hours, then released with serum 
stimulation and harvested at indicated timepoints. Whole cell lysates were analyzed by 
immunoblotting with monoclonal antibody to the C-terminus of hCdc5, or polyclonal antibodies 
to murine cyclins E (p50 CLNE), A (p60 CLNA), or Bl(p62 CLNB1). 
10 [0037] Figure 14B is a graph depicting cells at each timepoint in Figure 14A. Cells 

O were fixed in ethanol, stained with propidium iodide, and evaluated by flow cytometry for DNA 

C% content distribution to confirm synchronization. 

W [0038] Figure 1 5 shows the DNA sequences determined for the binding sites of the 

O synthetic oligonucleotides selected by repetitive screening/selection with recombinant His- 
t|5 tagged hCdc5. Those labeled 3.01, 3.01, 3.03, 3.04 and 3.05 were selected in the third round; 
'■; those labeled 6.01, 6.02, 6.03, 6.04, and 6.05 were selected in the sixth round and those labeled 
\2 8.01, 8.02, 8.03, 8.04, and 8.05 were selected in the eighth round. 

O DETAILED DESCRIPTION OF THE INVENTION 

20 Definitions 

[0039] Abbreviations for the twenty naturally occurring amino acids follow 
conventional usage. In the polypeptide notation used herein, the left-hand direction is the amino 
terminal direction and the right-hand direction is the carboxyl-terminal direction, in accordance 
with standard usage and convention. Similarly, unless specified otherwise, the left hand end of 

25 single-stranded polynucleotide sequences is the 5' end; the left hand direction of double-stranded 
polynucleotide sequences is referred to as the 5' direction. The direction of 5' to 3' addition of 
nascent RNA transcripts is referred to as the transcription direction; sequence regions on the 
DNA strand having the same sequence as the RNA and which are 5' to the 5' end of the RNA 
transcript are referred to as "upstream sequences"; sequence regions on the DNA strand having 

30 the same sequence as the RNA and which are 3' to the 3' end of the RNA transcript are referred 
to as "downstream sequences". 
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[0040] The term "nucleic acids", as used herein, refers to either DNA or RNA. 
"Nucleic acid sequence" or "polynucleotide sequence" refers to a single- or double-stranded 
polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. It 
includes both self-replicating plasmids, infectious polymers of DNA or RNA and nonfunctional 
5 DNA or RNA. 

[0041] "Nucleic acid probes" may be DNA or RNA fragments. DNA fragments can be 
prepared, for example, by digesting plasmid DNA, or by use of PCR, or synthesized by either the 
phosphoramidite method described by Beaucage and Carruthers, Tetrahedron Lett. 22:1859-1862 
(1981), or by the triester method according to Matteucci, et al., J. Am. Chem. Soc. 103:3185 
10 (1981), both incorporated herein by reference. A double stranded fragment may then be 
O obtained, if desired, by annealing the chemically synthesized single strands together under 
Ci appropriate conditions or by synthesizing the complementary strand using DNA polymerase with 
y \ an appropriate primer sequence. Where a specific sequence for a nucleic acid probe is given, it 
O is understood that the complementary strand is also identified and included. The complementary 
135 strand will work equally well in situations where the target is a double-stranded nucleic acid. 
ai _ [0042] The phrase "selectively hybridizing to" refers to a nucleic acid probe that 

y± hybridizes, duplexes or binds only to a particular target DNA or RNA sequence when the target 
jj; sequences are present in a preparation of total cellular DNA or RNA. "Complementary" or 
O "target" nucleic acid sequences refer to those nucleic acid sequences which selectively hybridize 
20 to a nucleic acid probe. Proper annealing conditions depend, for example, upon a probe's length, 
base composition, and the number of mismatches and their position on the probe, and must often 
be determined empirically. 

[0043] For discussions of nucleic acid probe design and annealing conditions, see, for 
example, Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd ed.), Vols. 1-3, Cold 
25 Spring Harbor Laboratory, (1989) or Current Protocols in Molecular Biology, F. Ausubel et al., 
ed. Greene Publishing and Wiley-Interscience, New York (1987). 

[0044] The phrase "nucleic acid sequence encoding" refers to a nucleic acid which 
directs the expression of a specific protein or peptide. The nucleic acid sequences include both 
the DNA strand sequence that is transcribed into RNA and the RNA sequence that is translated 
30 into protein. The nucleic acid sequences include both the full length nucleic acid sequences as 
well as non-full length sequences derived from the full length protein. It being further 
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understood that the sequence includes the degenerate codons of the native sequence or sequences 
which may be introduced to provide codon preference in a specific host cell. 

[0045] The phrase "isolated" or "substantially pure" refers to nucleic acid preparations 
that lack at least one protein or nucleic acid normally associated with the nucleic acid in a host 
5 cell. 

[0046] The term "operably linked" as used herein refers to linkage of a regulatory 
control element (e.g., promoter, enhancer, transcription activator, and the like) upstream from a 
DNA sequence such that the contril element mediates expression of the DNA sequence. 

[0047] The term "vector", refers to viral expression systems, autonomous self- 
10 replicating circular DNA (plasmids), and includes both expression and nonexpression plasmids. 

[0048] Where a recombinant microorganism or cell culture is described as hosting an 
*H "expression vector," this includes both extrachromosomal circular DNA and DNA that has been 
U1 incorporated into the host chromosome(s). Where a vector is being maintained by a host cell, the 
s4 vector may either be stably replicated by the cells during mitosis as an autonomous structure, or 

SIS 

5 t5 is incorporated within the host's genome. 

Si [0049] The term "gene" as used herein is intended to refer to a nucleic acid sequence 

ri which encodes a polypeptide. This definition includes various sequence polymorphisms, 
O mutations, and/or sequence variants wherein such alterations do not affect the function of the 
O gene product. The term "gene" is intended to include not only coding sequences but also 
20 regulatory regions such as promoters, enhancers, and termination regions. The term further 
includes all introns and other DNA sequences spliced from the mRNA transcript, along with 
variants resulting from alternative splice sites. 

[0050] The phrase "recombinant protein" or "recombinanfiy produced protein" refers to 
a peptide or protein produced using non-native cells that do not have an endogenous copy of 
25 DNA able to express the protein. The cells produce the protein because they have been 
genetically altered by the introduction of the appropriate nucleic acid sequence. The 
recombinant protein will not be found in association with proteins and other subcellular 
components normally associated with the cells producing the protein. The terms "protein" and 
"polypeptide" are used interchangeably herein. 
30 [0051] A "reporter" is a molecule whose presence or activity in a sample (e.g., in a cell) 

can be easily and rapidly detected. A "reporter protein" is a protein whose presence or activity in 
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a sample (e.g., in a cell) can be easily and rapidly detected. Reporter proteins are well known in 
the art and may include, for example, luciferase, (5-galactosidase, P-glucuronidase, 
chloramphenicol transacetylase, green fluorescent protein. 

[0052] By "immunogenic fragment" of a polypeptide is intended a fragment of the 
polypeptide that is capable of eliciting an immune response in a mammal similar to that 
produced by the intact polypeptide. Antibodies produced in response to exposure to an 
immunogenic fragment will typically, although not always, recognize the intact polypeptide as 
well. In general, an immunogenic fragment will be at least 6 consecutive amino acids of the 
polypeptide; preferably, an immunogenic fragment will have at least 10 consecutive amino acids 
of the polypeptide. 

[0053] The following terms are used to describe the sequence relationships between 
two or more nucleic acids or polynucleotides: "reference sequence", "comparison window", 
"sequence identity", "percentage of sequence identity", and "substantial identity". A "reference 
sequence" is a defined sequence used as a basis for a sequence comparison; a reference sequence 
may be a subset of a larger sequence, for example, as a segment of a full-length cDNA or gene 
sequence given in a sequence listing, or may comprise a complete cDNA or gene sequence. 

[0054] Optimal alignment of sequences for aligning a comparison window may, for 
example, be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. 
Math. 2:482 1981), by the homology alignment algorithm of Needleman and Wunsch J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman Proc. Natl. 
Acad. Sci. U.S.A. 85:2444 (1988), or by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 
7.0, Genetics Computer Group, 575 Science Dr., Madison, WI). 

[0055] The terms "substantial identity" or "substantial sequence identity" as applied to 
nucleic acid sequences and as used herein and denote a characteristic of a polynucleotide 
sequence, wherein the polynucleotide comprises a sequence that has at least 85 percent sequence 
identity as compared to a reference sequence over a comparison window of at least 20 nucleotide 
positions, frequently over a window of at least 25-50 nucleotides, wherein the percentage of 
sequence identity is calculated by comparing the reference sequence to the polynucleotide 
sequence which may include deletions or additions which total 20 percent or less of the reference 
sequence over the window of comparison. In preferably embodiments the polynucleotide 
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comprises a sequence that has at least 90 to 95 percent sequence identity, and more preferably at 
least 99 percent sequence identity as compared to a reference sequence. The reference sequence 
may be a subset of a larger sequence. 

[0056] As applied to polypeptides, the terms "substantial identity" or "substantial 
sequence identity" mean that two peptide sequences, when optimally aligned, such as by the 
programs GAP or BESTFIT using default gap weights, share at least 80 percent sequence 
identity. In preferred embodiments, the two peptide sequences will share at least 90 percent 
sequence identity, more preferably at least 95 percent sequence identity or more. "Percentage 
amino acid identity" or "percentage amino acid sequence identity" refers to a comparison of the 
amino acids of two polypeptides which, when optimally aligned, have approximately the 
designated percentage of the same amino acids. For example, "95% amino acid identity" refers 
to a comparison of the amino acids of two polypeptides which when optimally aligned have 95% 
amino acid identity. Preferably, residue positions which are not identical differ by conservative 
amino acid substitutions. For example, the substitution of amino acids having similar chemical 
properties such as charge or polarity are not likely to affect the properties of a protein. Examples 
include glutamine for asparagine or glutamic acid for aspartic acid. 

[0057] The phrase "substantially purified" or "isolated" when referring to a peptide or 
protein, means a chemical composition which is essentially free of other cellular components. It 
is preferably in a homogeneous state although it can be in either a dry or aqueous solution. 
Purity and homogeneity are typically determined using analytical chemistry techniques such as 
polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein 
which is the predominant species present in a preparation is substantially purified. Generally, a 
substantially purified or isolated protein will comprise more than 80% of all macromolecular 
species present in the preparation. Preferably, the protein is purified to represent greater than 
90% of all macromolecular species present. More preferably the protein is purified to greater 
than 95%, and most preferably the protein is purified to essential homogeneity, wherein other 
macromolecular species are not detected by conventional techniques. 

[0058] The phrase "specifically binds to an antibody" or "specifically immunoreactive 
with", when referring to a protein or peptide, refers to a binding reaction which is determinative 
of the presence of the protein in the presence of a heterogeneous population of proteins and other 
biologies. Thus, under designated immunoassay conditions, the specified antibodies bind to a 
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particular protein and do not bind in a significant amount to other proteins present in the sample. 
Specific binding to an antibody under such conditions may require an antibody that is selected 
for its specificity for a particular protein. A variety of immunoassay formats may be used to 
select antibodies specifically immunoreactive with a particular protein. For example, solid-phase 
5 ELISA immunoassays are routinely used to select monoclonal antibodies specifically 
immunoreactive with a protein. See Harlow and Lane (1988) Antibodies. A Laboratory Manual, 
Cold Spring Harbor Publications, New York, for a description of immunoassay formats and 
conditions that can be used to determine specific immunoreactivity. 

10 Nucleic Acids of the Invention 
O hCdc5 Nucleic Acids 

[0059] The instant invention provides hCdc5 ( also designated PCDC5RP) nucleic 
acids. The hCdc5 nucleic acids include hCdc5 gene or cDNA sequences, or sequences 
Q complementary thereto, or fragments thereof. The nucleic acid sequences of the invention are 
b typically identical to or show substantial sequence identity (determined as described above) to 
« the nucleic acid sequence of Figure 2D (SEQ ID NO: 1 1). Included in this definition are nucleic 
U acids which hybridize to the nucleic acid sequence of Figure 2D under stringent conditions. 
9 "Stringent" as used herein refers to hybridization and wash conditions of 50% formamide at 
O 42°C. Other hybridization conditions may also be selected that will provide the equivalent 
'20 degree of stricngency. Generally, stringent conditions are selected to be about 5° C lower than 
the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly matched probe. Typically, stringent conditions will be those in which 
the salt concentration is at least about 0.02 molar at pH 7 and the temperature is at least about 
25 60°C. As other factors may significantly affect the stringency of hybridization, including, 
among others, base composition and size of the complementary strands, the presence of organic 
solvents and the extent of base mismatching, the combination of parameters is more important 
than the absolute measure of any one. 

[0060] Oligonucleotides comprising at least 8-100 consecutive nucleotides unique to 
30 the sequence of Figure 2D (SEQ ID NO: 11) or its complement, more preferably 15-50 
consecutive nucleotides unique to the sequence of Figure 2D (SEQ ID NO:l 1) or its complement 
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are also provided in the instant invention. Such oligonucleotides may be provided as member of 
primer pairs. Oligonucleotides are useful, for example, for amplification of hCdc5 sequences, 
whether genomic or cDNA, and in diagnostic assays. 

[0061] "Wild-type" or "unaffected" hCdc5 nucleic acids are identical to or substantially 
identical to the sequence of Figure 2D (SEQ ID NO: 11). These nucleic acids may comprise 
sequence polymorphisms, silent mutations, or other mutations which do not affect the function of 
the hCdc5 protein. 

[0062] The hCdc5 nucleic acids of the inventions are useful for diagnostic assays to 
detect, for example, mutations in the hCdc5 gene. Such mutations may be indicative of aberrant 
forms of hCdc5 which have lost the ability to regulate cell cycle progression. Such mutations 
may cause hyperproliferation, aberrant cell size, delayed entry into mitosis, apoptosis or other 
cell death, or other phenotype associated with a defect in cell cycle progression. Various formats 
for such assays are described in detail below. 

[0063] The hCdc5 nucleic acids of the invention are also useful in the therapy of 
pathologies such as diseases, syndromes, or other undesirable conditions resulting from defects 
in cell cycle progression. Such cell cycle defects may result from defects in the hCdc5 gene 
itself, in the regulation of expression of the hCdc5 gene, or in a step "downstream" of hCdc5 in 
the regulation of cell cycle progression through G2 and entry into mitosis. In an embodiment, 
nucleic acids of the invention encoding antagonists of hCdc5 are useful in preventing or delaying 
the progression of gametes through G2 and entry into mitosis as a form of birth control. In other 
embodiments, nucleic acids of the invention encoding antagonists of hCdc5 are useful in the 
treatment of pathologies such as but not limited to hyperproliferative diseases such as cancer 
(e.g., leukemia, lymphoma, breast cancer, colon cancer, prostate cancer, Wilms' tumor), 
coronary artery disease, pulmonary vascular obstructive disease, either primary or as a feature of 
Eisenmenger's syndrome, and other disorders of abnormal cellular proliferation. Cells to be 
treated include but are not limited to hyperproliferative cells, cancer cells, vascular smooth 
muscle cells, endothelial cells, gametes, etc. 

[0064] In some embodiments of the invention, the hCdc5 nucleic acids of the invention 
encoding wild type protein or agonists of hCdc5 are used to stimulate, promote, or facilitate 
progression through the cell cycle, such as in the cellular regeneration of terminally 
differentiated cardiac myocytes or tissues, e.g., striated muscle myocytes. For example, this 
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could allow restoration of damaged myocardium after cardiac injury, myocardial infarction, 
myocarditis, cardiomyopathy, trauma, as a consequence of cardiac surgery, etc., or repletion of 
striated muscle exhausted by muscular dystrophy. 

[0065] The hCdc5 nucleic acids are typically delivered to cells by the methods 
5 described in more detail below. In some embodiments, expression of the polypeptides encoded 
by the nucleic acids is expected to prevent, ameliorate, or lessen the cell cycle defect of the host 
cell, or to restore normal cell cycle progression of the host cell. Whether provided via nucleic 
acid or polypeptides delivered directly to cells, the therapeutic formulations of the invention can 
also be used as adjuncts to other forms of therapy, including but not limited to chemotherapy, 
10 radiation therapy, and so on. 

[0066] Such nucleic acids may encode the wild-type, or unaffected gene product, or 
ri may encode antagonists or agonists of hCdc5 gene in the host cell. An antagonist of hCdc5 is an 
IH agent which delays or prevents progression through G2 or entry into mitosis. An agonist of 
f\ hCdc5 is an agent which promotes or facilitates progression through G2 and entry into mitosis. 
f|5 A therapeutic dose of an antagonist of the invention is a dose sufficient to prevent, ameliorate, or 
delay a host cell's progression through G2 or entry into mitosis. A therapeutic dose of an agonist 
Tl of the invention is a dose sufficient to promote or facilitate host cell progression through G2 and 
■ff entry into mitosis. 

□ [0067] In some embodiments, antisense antagonists of hCdc5 expression are provided. 

"*20 For a review of the design considerations and use of antisense oligonucleotides, see Uhlmann et 
al. Chemical Reviews 90:543-584 (1990) the disclosure of which is hereby incorporated by 
reference. The antisense oligonucleotides of the present invention may be synthesized by any of 
the known chemical oligonucleotide synthesis methods. Such methods are generally described, 
for example, in Winnacker From Genes to Clones: Introduction to Gene Technology, VCH 
25 Verlagsgesellschaft mhH (H. Ibelgaufts trans. 1987). Antisense oligonucleotides are 
advantageously prepared by utilizing any of the commercially available, automated nucleic acid 
synthesizers. One such device, the Applied Biosystems 380B DNA Synthesizer, utilizes p- 
cyanoethyl phosphoramidite chemistry. 

[0068] Antisense oligonucleotides hybridizable with any portion of a hCdc5 gene may 
30 be prepared by oligonucleotide synthesis methods known to those skilled in the art. While any 
length oligonucleotide may be utilized in the practice of the invention, sequences shorter than 12 
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bases may be less specific in hybridizing to the target mRNA, may be more easily destroyed by 
enzymatic digestion, and may be destabilized by enzymatic digestion. Hence, oligonucleotides 
having 12 or more nucleotides are preferred. Long sequences, particularly sequences longer than 
about 40 nucleotides, may be somewhat less effective in inhibiting translation because of 
5 decreased uptake by the target cell. Thus, oligomers of 12-40 nucleotides are preferred, more 
preferably 15-30 nucleotides, most preferably 18-26 nucleotides. Sequences of 18-24 
nucleotides are most particularly preferred. 

[0069] In an embodiment of the invention, an hCdc5 polypeptide is provided to a host 
cell subject to overexpression in that host cell; i.e., the amount of hCdc5 produced in the host cell 
10 is substantially greater than the typical endogenous level in a host cell. Such overexpression can 
H; lead to the death of the host cell. Thus, for example, delivery of the hCdc5 nucleic acid to a host 
cell operably linked to a high level promoter is desired for the therapy of hyperproliferative 
Ul disease such as cancer. The hCdc5 can be wild-type or unaffected, or mutant, with the 
r i requirement that such overexpression result in cell death. Similarly, excess amounts, i.e., greater 
5 f 5 than normal physiological levels of hCdc5 polypeptides can be administered to host cells to 
si result in the death of those cells. 

Q hCdc5 Binding Site Nucleic Acids 

H [0070] The invention also provides hCdc5 binding site nucleic acids. The hCdc5 

|O |0 binding site nucleic acids include nucleic acid comprising a sequence that is specifically bound 
by the hCdc5 polypeptide, in particular by the DNA binding domain of the hCdc5 polypeptide. 
By "specifically bound" or "specific binding," with respect to hCdc5 binding site nucleic acid- 
hCdc5 polypeptide interaction, is intended that the polypeptide and the DNA binding site interact 
with a dissociation equilibrium constant of 10" 7 M or lower. In preferred embodiments, the 
25 dissociation equilibrium contant is 10" M or lower. 

[0071] The hCdc5 binding site nucleic acid can have the sequence of a naturally 
occurring hCdc5 binding site or can have a synthetic, empirically determined sequence. 
Preferably, the hCdc5 binding site nucleic acid is double-stranded and comprises a core sequence 
of "ANCA" (where N is any nucleotide, which core sequence is flanked within 3 bases by a 
30 palindromic sequence. By "within 3 bases" is meant that there are no more than 3 bases between 
the "A" residues at each end of the core sequence and the palindromic sequence. By 
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"palindromic sequence" is intended that the sequence of one side of the palindrome is identical 
to the complement of the other side; for example, the sequence "CGGTACCG" forms a 
palindrome having four bases on each side around the central "TA" sequence, the sequence 
"CTGAAACAG" forms a palindrome having three bases on each side around the central 
"AAA". The palindromic sequence is at least 3 bases in length and may be longer. By a 
palindromic sequence of at least three bases is meant that each side of the palindrome is at least 
three bases in length. The palindromic sequence can include the bases of the core sequence, that 
is the palindromic sequence can overlap the core sequence. Typically, the palindromic sequence 
is an AT-rich sequence. By "an AT-rich sequence" is meant that 80% or more of the bases 
forming the palindromic sequence are either A or T. The core sequence is preferably AWCA 
where W is A or T. 

[0072] Particularly preferred embodiments of the hCdc5 binding site nucleic acid 
include those comprising one of the following sequences: 
GATTTAACATAA (SEQIDNO:13) 
TTAACATAA (SEQIDNO:14) 
AATAAAATCAAAATT (SEQIDNO:15) 
AAAGGGGAACACTTT (SEQ ID NO: 16) 
ATTTAACATAA (SEQ ID NO: 20) 
TATTTAACATAA (SEQIDNO:21) 
GCTTTAACATAA (SEQ ID NO:22) or 
GATTTACCATAA (SEQ ID NO:27) 

[0073] The hCdc5 binding site nucleic acid is useful in a method for identifying 
compounds that act as effectors of hCdc5 regulation. The effectors can be inhibitors of the 
hCdc5 protein-DNA binding site binding interaction or they can be enhancers of the binding 
interaction. An inhibitor compound of the binding interaction will, in general, be an antagonist 
of hCdc5 as described above. An enhancer compound of the binding interaction will, in general, 
be an agonist of hCdc5 as described above. The method of the present invention for identifying 
an inhibitor compound of hCdc5 binding to a DNA binding site comprises (a) contacting a 
hCdc5 polypeptide with a hCdc5 binding site nucleic acid under conditions in which the hCdc5 
polypeptide binds to the hCdc5 binding site nucleic acid, wherein said hCdc5 polypeptide is 
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selected from the group consisting of a polypeptide having the amino acid sequence of SEQ ID 
NO:l, a biologically active fragment thereof, and a variant thereof, and wherein said hCdc5 
binding site nucleic acid comprises nucleic acid having the sequence ANCA flanked within 3 
bases by palindromic sequence; (b) determining the affinity of the binding between said hCdc5 
5 polypeptide and the hCdc5 binding site nucleic acid, (c) carrying out step (a) in the presence of a 
compound to be tested, (d) determining the affinity of the binding between said hCdc5 
polypeptide and the hCdc5 binding site nucleic acid in the presence of said compound, and (e) 
selecting a compound for which the affinity of binding determined in (b) is greater than the 
affinity of binding determined in (d). 
10 [0074] The method of the present invention for identifying an enhancer compound of 

O hCdc5 binding to a DNA binding site comprises (a) contacting a hCdc5 polypeptide with a 
JrJ hCdc5 binding site nucleic acid under conditions in which the hCdc5 polypeptide binds to the 
hCdc5 binding site nucleic acid, wherein said hCdc5 polypeptide is selected from the group 
□ consisting of a polypeptide having the amino acid sequence of SEQ ID NO:l, a biologically 
135 active fragment thereof, and a variant thereof, and wherein said hCdc5 binding site nucleic acid 
^ comprises nucleic acid having the sequence ANCA flanked within 3 bases by palindromic 
f~I sequence; (b) determining the affinity of the binding between said hCdc5 polypeptide and the 
^ hCdc5 binding site nucleic acid, (c) carrying out step (a) in the presence of a compound to be 
O tested, (d) determining the affinity of the binding between said hCdc5 polypeptide and the hCdc5 
20 binding site nucleic acid in the presence of said compound, and (e) selecting a compound for 
which the affinity of binding determined in (b) is less than the affinity of binding determined in 
(d). 

[0075] When used in connection with the method for identifying an inhibitor or 
enhancer of hCdc5-DNA binding, a "biologically active fragment" of an hCdc5 polypeptide is a 

25 polypeptide that comprises the DNA binding domain of hCdc5 and is capable of specifically 
binding to the hCdc5 binding site nucleic acid. Preferably, a biologically active fragment of 
hCdc5 comprises amino acids 1-120 of SEQ ID NO: 1, more preferably a biologically active 
fragment of hCdc5 comprises amino acids 1-500 of SEQ ID NO: 1. When used in connection 
with the method for identifying an inhibitor or enhancer of hCdc5-DNA binding, a "variant" of 

30 an hCdc5 polypeptide is a polypeptide that has substantial sequence identity to SEQ ID NO: 1, or 
a fragment thereof, and is capable of specifically binding to the hCdc5 binding site nucleic acid. 
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"Affinity" of binding is used in the conventional sense and intends the strength of the protein- 
nucleic acid binding interaction as measured by any of a number of standard parameters, for 
example, the dissociation equilibrium constant. It will be recognized that, depending upon the 
parameter used to quantitate the affinity of binding, an interaction of greater affinity may have a 
5 higher or a lower numerical value for the measured parameter. 

[0076] Typically, the methods of the present invention are carried out as follows. The 
hCdc5 polypeptide and the hCdc5 binding site nucleic acid are incubated together in the presence 
or absence of a compound to be tested as an effector in a binding buffer (for example, 1 0 mM 
HEPES, pH 7.8, 75 mM KC1, 2.5 mM MgCl 2 , 3% Ficoll, 40 ug/ml poly(dl-dC)), in the presence 
10 of a protease inhibitor (for example, 1 fig/ml phenylmethylsulfonyl fluoride, 0.4 ng/ml aprotinin, 

□ 1 ng/ml leupeptin, 0.4 ng/ml soybean trypsin inhibitor, 0.4 ng/ml pepstatin A, and/or 10 ng/ml 
benzamidine), at room temperature for 30 min. Typically, the hCdc5 binding site nucleic acid 

Ul will be detectably labeled (for example, with a radioactive label). Free nucleic acid can be 
q separated from that bound to the hCdc5 polypeptide by any method known in the art, for 
"|35 example, by non-denaturing acrylamide gel electrophoresis. The amount of nucleic acid present 
^ in each of the bound and the free fractions is quantitated by techniques well known in the art. 
i2 Equilibrium dissociation constants (K d ) can be determined by incubating a constant amount of 
•ff labeled nucleic acid (for example, 1 nM) with increasing amounts of polypeptide (for example, 

□ 2.5-40 nM) at room temperature for 30 minutes. After binding activity was quantitated as 
! ~20 described above, the Kd is calculated with the equation AT^=[D][P]/[DP], where [D] is the 

concentration of free DNA, [P] is the concentration of free protein, and [DP] is the concentration 
of the DNA-protein complex. The DNA concentration is made limiting relative to the 
polypeptide concentration to allow the approximation [DP]=[DP to tai]- 

[0077] Compounds identified as inhibitors of the binding interaction of hCdc5 

25 polypeptide and the hCdc5 binding site nucleic acid are useful as antagonists of hCdc5 and can 
be used in the methods described herein for antagonists generally. In particular, inhibitor 
compounds of hCdc5 binding to a DNA binding site can be used in a method for treating a cell 
cycle defect in a patient. Such cell cycle defects may result from defects in the hCdc5 gene 
itself, in the regulation of expression of the hCdc5 gene, or in a step "downstream" of hCdc5 in 

30 the regulation of cell cycle progression through G2 and entry into mitosis. In an embodiment, 
antagonists of hCdc5, such as the inhibitor compounds, are useful in preventing or delaying the 
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progression of gametes through G2 and entry into mitosis as a form of birth control. In other 
embodiments, antagonists of hCdc5 are useful in the treatment of pathologies such as but not 
limited to hyperproliferative diseases such as cancer (e.g., leukemia, lymphoma, breast cancer, 
colon cancer, prostate cancer, Wilms' tumor), coronary artery disease, pulmonary vascular 
obstructive disease, either primary or as a feature of Eisenmenger's syndrome, and other 
disorders of abnormal cellular proliferation. Cells to be treated include but are not limited to 
hyperproliferative cells, cancer cells, vascular smooth muscle cells, endothelial cells, gametes, 
etc. Inhibitor compounds of hCdc5 binding to a DNA binding site are also useful in a method of 
regulating the progression of a cell cycle through G2 and into mitosis. 

[0078] Compounds identified as enhancers of the binding interaction of hCdc5 
polypeptide and the hCdc5 binding site nucleic acid are useful as agonists of hCdc5 and can be 
used in the methods described herein for agonists generally. In particular, enhancer compounds 
of hCdc5 binding to a DNA binding site can be used in a method for promoting cell division in 
cells of normally non-regenerable tissue, for instance, in terminally differentiated cardiac 
myocytes or tissues, e.g, striated muscle myocytes. 

[0079] The hCdc5 binding site nucleic acid is also useful in connection with a hCdc5 
polypeptide to regulate expression of an operably linked DNA sequence. Vectors comprising 
DNA sequence encoding a protein of interest can be operably linked to hCdc5 binding site 
nucleic acid can be prepared by techniques that are routine in the art, for example, as described 
in Sambrook et al. One or more copies of the hCdc5 binding site nucleic acid can be operably 
linked to a DNA sequence encoding a protein of interest. The protein of interest can be any 
desirable protein. In particular embodiments the protein of interest is a reporter protein. Such 
vectors are useful in the method of the present invention to detect the expression of hCdc5 in a 
cell. In such a method, the vector is introduced into the cell and the expression of the reporter 
protein indicates the presence of hCdc5 in the cell. The vectors are also useful in the methods of 
the present invention to identify compounds that affect the interaction of hCdc5 with the hCdc5 
binding site nucleic acid. The vector can be introduced into host cells by any of a number of 
methods well known in the art. Expression of the protein of interest in the host cell can be 
regulated by the presence or absence of hCdc5 polypeptide. The hCdc5 polypeptide may be 
endogenous to the host cell or can be provided exogenously, for example, by co-introduction of a 
vector comprising an hCdc5 nucleic acid encoding a hCdc5 polypeptide of the invention. 
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Polypeptides of the Invention 

[0080] The polypeptides of the invention include the polypeptides encoded by the 
nucleic acid of Figure 2D (SEQ ID NO:l 1), amino acid substitutions thereof which do not affect 
function of the hCdc5 protein, biologically active fragments thereof, and mutants or variants 
5 thereof which can serve as agonists or antagonists of hCdc5. 

[0081] The polypeptides of the invention are useful, for example, in the therapy of 
defects in cell cycle progression as discussed above and in screening for and isolation of co- 
activators, accessory proteins, or targets of the hCdc5 protein. In some embodiments, 
administration of a polypeptide, for example, is expected to prevent, ameliorate, or lessen the 
10 defect in cell cycle progression of the host cell, or to restore normal cell cycle progression of the 
G host cell. Such polypeptides may comprise the full length wild-type, or unaffected gene product, 
C\ fragments thereof, or agonists of hCdc5 in the host cell. Antagonists and agonists are discussed 
W above, as are exemplary indications for treatment with the polypeptides of the invention. 
q [0082] The polypeptides of the invention are also useful for the generation of and 

S JJ5 screening of antibodies. Such antibodies are useful, for example, as antagonists of hCdc5 
protein, in the purification of the polypeptides of the invention, and in diagnostic assays for 
i2 hCdc5 protein. 

O Agonists and Antagonists of hCdc5 

' 20 [0083] As discussed above, in some embodiments of the invention, antagonists and 

agonists of hCdc5 are provided. Such antagonists or agonists include but are not limited to 
antibodies which specifically bind to hCdc5; antibodies which specifically bind to an hCdc5 co- 
activator, accessory protein, or target; mutants of hCdc5 protein; antisense hCdc5 nucleic acids; 
co-activators or accessory proteins for hCdc5; and peptide, non-peptide, and peptidomimetic 
25 analogs of such co-activators, accessory proteins, or targets of the hCdc5 polypeptide; and 
effector compounds that are inhibitors or enhancers of the hCdc5 interaction with a DNA binding 
site. 

[0084] Thus, for example, an exemplary antagonist of hCdc5 is a dominant negative 
hCdc5 mutant such as disclosed in the experimental examples. In general, it is expected that 
30 hCdc5 mutants which retain the ability to bind to a ligand such as a target promoter but which 
lack other function(s) which stimulates, promotes, or facilitates progression though G2 and entry 
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into mitosis can serve as antagonists. Thus, for example, deletion mutants which can still bind a 
ligand such as a promoter are expected to serve as antagonists of hCdc5. Point mutations, such 
as amino acid substitutions, are also useful, such as in the substitution of an amino acid otherwise 
subject to phosphorylation. An improperly phosphorylated protein would be expected to retain 
5 its ability to bind to a ligand such as a promoter, but would not provide some other function 
relevant to the progression though G2 or entry into mitosis. These mutants would thus compete 
with endogenous hCdc5 in the host cell for natural ligands. 

[0085] Agonists of hCdc5 include the wild-type hCdc5 polypeptide and may also 
comprise mutants of hCdc5. For example, constitutively active mutants which do not require 
1 0 phosphorylation for activation may compete successfully in the cell with the endogenous hCdc5. 

^ Ligands of hCdc5 

if! [0086] As used herein, "ligand" means a molecule that is recognized by a particular 

O protein. The agent bound by or reacting with the protein is called a "ligand", a term which is 
%5 definitionally meaningful only in terms of its counterpart protein. The term "ligand" does not 
- imply any particular molecular size or other structural or compositional feature other than that 
U= the substance in question is capable of binding or otherwise interacting with the protein. Also, a 
S "ligand" may serve either as the natural ligand to which the hCdc5 polypeptide binds or interacts, 
O or as a functional analogue that may act as an agonist or antagonist. Thus, the present invention 
' 20 provides methods for the affinity purification of ligands that interact with the hCdc5. Ligands 
that can be investigated by this invention include but are not restricted to, agonists and 
antagonists, toxins and venoms, viral epitopes, hormones, sugars, cofactors, peptides, enzyme 
substrates, cofactors, drugs, and proteins. 

[0087] Furthermore, peptide and protein ligands can be obtained by "panning" peptide 
25 or protein libraries displayed on filamentous bacteriophage against immobilized hCdc5 protein. 
(For examples of techniques, see Roberts et al. (1992) Gene 121: 9-15; Dennis et al. (1995) J 
Biol Chem 270: 25411-25417; Wang et al. (1995) J Biol Chem 270: 12250-12256). Briefly, 
bound phage are eluted at low pH and amplified in host cells from which the hCdc5-binding 
peptide sequences can be recovered by PCR. Ligands may also be identified using the yeast two- 
30 hybrid system (Fields, S. et al., Nature 340:245-247 (1989)). The hCdc5-binding peptides can 
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then be tested for their ability to inhibit an hCdc5 activity, such as progression through G2 and 
entry into mitosis, binding to a promoter sequence, and so on. 

[0088] Nucleic acid binding sites, that is, nucleic acid sequences that are recognized 
and specifically bound by the hCdc5 polypeptide or by a biologically active fragment or variant 
5 thereof are included in the term ligand. The sequence may be a naturally occurring DNA 
sequence (e.g., a promoter regulated by hCdc5) or may be synthetic sequence. The synthetic 
sequence may be a functional analogue of a naturally occurring DNA binding site. Nucleic acid 
binding sites of hCdc5 may be selected by any of a number of ways that are known in the art or 
described herein. For example, synthetic oligonucleotides having a random sequence, or random 
10 fragment of genomic DNA, can be subjected to affinity purification using hCdc5 or biologically 
r-^ active fragments thereof. In particular, a hCdc5 fragment comprising the DNA binding domain 
■ c (amino acids 1-120) modified by the addition of His residues to the carboxyl or amino terminal 
i|] can be contacted with a pool of random sequence synthetic oligonucleotides. The His-tagged 
i=n polypeptide together with any bound nucleic acid is isolated using a Ni resin (e.g, Ni-NTA 
= ff 5 agarose) as described, for instance in Van Dyke et al (1992). Alternatively, random or selected 
Si DNA fragments, synthetic or naturally occurring, can be used in a yeast two-hybrid system in 
H combination with a hCdc5 fusion protein to identify DNA sequence that interact with hCdc5 in 
O vivo. 

l *lX) General Methods 

[0089] The nucleic acid compositions of this invention, whether RNA, cDNA, genomic 
DNA, or a hybrid of the various combinations, may be isolated from natural sources, including 
cloned DNA, or may be synthesized in vitro. The nucleic acids claimed may be present in 
transformed or transfected whole cells, in a transformed or transfected cell lysate, or in a 
25 partially purified or substantially pure form. 

[0090] Techniques for nucleic acid manipulation of the nucleic acid sequences of the 
invention such as subcloning nucleic acid sequences encoding polypeptides into expression 
vectors, labeling probes, DNA hybridization, and the like are described generally in Sambrook et 
al, Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 
30 Laboratory, Cold Spring Harbor, New York, (1989), which is incorporated herein by reference. 
This manual is hereinafter referred to as "Sambrook et al." 
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[0091] There are various methods of isolating the nucleic acid sequences of the 
invention. For example, DNA is isolated from a genomic or cDNA library using labeled 
oligonucleotide probes having sequences complementary to the sequences disclosed herein. 
Such probes can be used directly in hybridization assays. Alternatively probes can be designed 
5 for use in amplification techniques such as PCR. 

[0092] To prepare a cDNA library, mRNA is isolated from tissue such as heart or 
pancreas, preferably a tissue wherein expression of the gene or gene family is likely to occur. 
cDNA is prepared from the mRNA and ligated into a recombinant vector. The vector is 
transfected into a recombinant host for propagation, screening and cloning. Methods for making 
10 and screening cDNA libraries are well known. See Gubler, U. and Hoffman, B.J. Gene 25:263- 
p 269 (1983) and Sambrooketal. 

^1 [0093] For a genomic library, the DNA is extracted from tissue and either mechanically 

Ul sheared or enzymatically digested to yield fragments of about 12-20kb. The fragments are then 
rj separated by gradient centrifugation from undesired sizes and are constructed in bacteriophage 
"tf5 lambda vectors. These vectors and phage are packaged in vitro, as described in Sambrook, et al. 

Recombinant phage are analyzed by plaque hybridization as described in Benton and Davis, 
jT Science 196:180-182 (1977). Colony hybridization is carried out as generally described in M. 

O Grunstein et al. Proc. Natl. Acad. Sci. USA. 72:3961-3965 (1975). 

03 

□ [0094] DNA of interest is identified in either cDNA or genomic libraries by its ability 

"20 to hybridize with nucleic acid probes, for example on Southern blots, and these DNA regions are 
isolated by standard methods familiar to those of skill in the art. See Sambrook, et al. DNA 
binding sites for hCdc5 can be identified by the ability to specifically bind the hCdc5 
polypeptide. 

[0095] Various methods of amplifying target sequences, such as the polymerase chain 
25 reaction, can also be used to prepare DNA encoding hCdc5 protein. Polymerase chain reaction 
(PCR) technology is used to amplify such nucleic acid sequences directly from mRNA, from 
cDNA, and from genomic libraries or cDNA libraries. The isolated sequences encoding hCdc5 
protein may also be used as templates for PCR amplification. 

[0096] In PCR techniques, oligonucleotide primers complementary to the two 3' 
30 borders of the DNA region to be amplified are synthesized. The polymerase chain reaction is 
then carried out using the two primers. See PCR Protocols: A Guide to Methods and 
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Applications (Innis, M, Gelfand, D., Sninsky, J. and White, T., eds.), Academic Press, San Diego 
(1990). Primers can be selected to amplify the entire regions encoding a full-length sequence of 
interest or to amplify smaller DNA segments as desired. 

[0097] PCR can be used in a variety of protocols to isolate cDNA's encoding a 
5 sequence of interest. In these protocols, appropriate primers and probes for amplifying DNA 
encoding a sequence of interest are generated from analysis of the DNA sequences listed herein. 
Once such regions are PCR-amplified, they can be sequenced and oligonucleotide probes can be 
prepared from the sequence obtained. 

[0098] Oligonucleotides for use as probes are chemically synthesized according to the 
10 solid phase phosphoramidite triester method first described by Beaucage, S.L. and Carruthers, 
r 3 M.H., Tetrahedron Lett., 22(20) :1859-1862 (1981) using an automated synthesizer, as described 
fl in Needham-VanDevanter, D.R., et al, Nucleic Acids Res. 12:6159-6168 (1984). Purification 
Iff of oligonucleotides is by either native acrylamide gel electrophoresis or by anion-exchange 
^ HPLC as described in Pearson, J.D. and Regnier, F.E., J. Chrom., 255:137-149 (1983). The 
ft 5 sequence of the synthetic oligonucleotide can be verified using the chemical degradation method 
of Maxam, A.M. and Gilbert, W., in Grossman, L. and Moldave, D., eds. Academic Press, New 
H York, Methods in Enzymology 65:499-560 1980). 

q Expression 

^0 [0099] Once DNA encoding a sequence of interest is isolated and cloned, one can 

express the encoded proteins in a variety of recombinantly engineered cells. It is expected that 
those of skill in the art are knowledgeable in the numerous expression systems available for 
expression of DNA encoding a sequence of interest. No attempt to describe in detail the various 
methods known for the expression of proteins in prokaryotes or eukaryotes is made here. 

25 [00100] In brief summary, the expression of natural or synthetic nucleic acids encoding 

a sequence of interest will typically be achieved by operably linking the DNA or cDNA to a 
promoter (which is either constitutive or inducible), followed by incorporation into an expression 
vector. The vectors can be suitable for replication and integration in either prokaryotes or 
eukaryotes. Typical expression vectors contain transcription and translation terminators, 

30 initiation sequences, and promoters useful for regulation of the expression of polynucleotide 
sequence of interest. To obtain high level expression of a cloned gene, it is desirable to construct 
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expression plasmids which contain, at the minimum, a strong promoter to direct transcription, a 
ribosome binding site for translational initiation, and a transcription/translation terminator. The 
expression vectors may also comprise generic expression cassettes containing at least one 
independent terminator sequence, sequences permitting replication of the plasmid in both 
5 eukaryotes and prokaryotes, i.e., shuttle vectors, and selection markers for both prokaryotic and 
eukaryotic systems. See Sambrook et al. Examples of expression systems in both prokaryotic 
and eukaryotic systems are described below. 
Expression in Prokaryotes 

[00101] A variety of procaryotic expression systems may be used to express the 
10 polypeptides of the invention. Examples include E. coli, Bacillus, Streptomyces, and the like. 
n [00102] It is preferred to construct expression plasmids which contain, at the minimum, 

a strong promoter to direct transcription, a ribosome binding site for translational initiation, and a 
ij] transcription/translation terminator. Examples of regulatory regions suitable for this purpose in 
E. coli are the promoter and operator region of the E. coli tryptophan biosynthetic pathway as 
=J 5 described by Yanofsky, C, J. Bacteriol. 158:1018-1024 (1984) and the leftward promoter of 
r phage lambda (PA.) as described by Herskowitz, I. and Hagen, D., Ann. Rev. Genet. 14:399-445 
XI (1980). The inclusion of selection markers in DNA vectors transformed in E. coli is also useful. 
Q Examples of such markers include genes specifying resistance to ampicillin, tetracycline, or 
□ chloramphenicol. See Sambrook et al. for details concerning selection markers for use in E. coli. 
=""20 [00103] To enhance proper folding of the expressed recombinant protein, during 

purification from E. coli, the expressed protein may first be denatured and then renatured. This 
can be accomplished by solubilizing the bacterially produced proteins in a chaotropic agent such 
as guanidine HC1 and reducing all the cysteine residues with a reducing agent such as beta- 
mercaptoethanol. The protein is then renatured, either by slow dialysis or by gel filtration. See 
25 U.S. Patent No. 4,51 1,503. 

[00104] Detection of the expressed antigen is achieved by methods known in the art as 
radioimmunoassay, or Western blotting techniques or immunoprecipitation. Purification from 
E. coli can be achieved following procedures such as those described in U.S. Patent No. 
4,511,503. 
30 Expression in Eukaryotes 
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[00105] A variety of eukaryotic expression systems such as yeast, insect cell lines, bird, 
fish, and mammalian cells, are known to those of skill in the art. As explained briefly below, a 
sequence of interest may be expressed in these eukaryotic systems. 

[00106] Synthesis of heterologous proteins in yeast is well known. Methods in Yeast 
5 Genetics, Sherman, F., et al, Cold Spring Harbor Laboratory, (1982) is a well recognized work 
describing the various methods available to produce the protein in yeast. 

[00107] Suitable vectors usually have expression control sequences, such as promoters, 
including 3-phosphoglycerate kinase or other glycolytic enzymes, and an origin of replication, 
termination sequences and the like as desired. For instance, suitable vectors are described in the 
10 literature (Botstein, et al, Gene 8:17-24 (1979); Broach, et al, Gene 8:121-133 (1979)). 
^ [00108] Two procedures are used in transforming yeast cells. In one case, yeast cells 

*Q are first converted into protoplasts using zymolyase, lyticase or glusulase, followed by addition 
in of DNA and polyethylene glycol (PEG). The PEG-treated protoplasts are then regenerated in a 
,2 3% agar medium under selective conditions. Details of this procedure are given in the papers by 
«§5 J.D. Beggs, Nature (London) 275:104-109 (1978); and Hinnen, A., et al, Proc. Natl. Acad. Sci. 
r U.S.A. 75:1929-1933 (1978). The second procedure does not involve removal of the cell wall. 
H Instead the cells are treated with lithium chloride or acetate and PEG and put on selective plates 
O (Ito, H., et al., J. Bact. 153:163-168 (1983)). 

S [00109] The polypeptides of the invention, once expressed, can be isolated from yeast 

?> 20 by lysing the cells and applying standard protein isolation techniques to the lysates. The 
monitoring of the purification process can be accomplished by using Western blot techniques or 
radioimmunoassay of other standard immunoassay techniques. 

[00110] The sequences encoding the polypeptides of the invention can also be ligated 
to various expression vectors for use in transforming cell cultures of, for instance, mammalian, 
25 insect, bird or fish origin. Illustrative of cell cultures useful for the production of the 
polypeptides are mammalian cells. Mammalian cell systems often will be in the form of 
monolayers of cells although mammalian cell suspensions may also be used. A number of 
suitable host cell lines capable of expressing intact proteins have been developed in the art, and 
include the HEK293, BHK21, and CHO cell lines, and various human cells such as COS cell 
30 lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. Expression vectors for these cells can 
include expression control sequences, such as an origin of replication, a promoter (e.g., the CMV 
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promoter, a HSV tk promoter or pgk (phosphoglycerate kinase) promoter), an enhancer (Queen 
et al. Immunol. Rev. 89:49 (1986)), and necessary processing information sites, such as ribosome 
binding sites, RNA splice sites, polyadenylation sites (e.g., an SV40 large TAg poly A addition 
site), and transcriptional terminator sequences. Other animal cells useful for production of 
5 hCdc5 proteins are available, for instance, from the American Type Culture Collection Catalogue 
of Cell Lines and Hybridomas (7th edition, (1992)). 

[00111] Appropriate vectors for expressing the polypeptides of the invention in insect 
cells are usually derived from the SF9 baculovirus. Suitable insect cell lines include mosquito 
larvae, silkworm, armyworm, moth and Drosophila cell lines such as a Schneider cell line (See 
10 Schneider J. Embryol. Exp. Morphol. 27:353-365 (1987). 

5=5: [00112] As indicated above, the vector, e.g., a plasmid, which is used to transform the 

^ host cell, preferably contains DNA sequences to initiate transcription and sequences to control 
1 11 the translation of the protein. These sequences are referred to as expression control sequences. 
;4 [00113] As with yeast, when higher animal host cells are employed, polyadenylation or 

=#5 transcription terminator sequences from known mammalian genes need to be incorporated into 
the vector. An example of a terminator sequence is the polyadenylation sequence from the 
J~f bovine growth hormone gene. Sequences for accurate splicing of the transcript may also be 
O included. An example of a splicing sequence is the VP1 intron from SV40 (Sprague, J. et al., 
p J. Virol. 45: 773-781 (1983)). 

r 20 [00114] Additionally, gene sequences to control replication in the host cell may be 

incorporated into the vector such as those found in bovine papilloma virus type-vectors. Saveria- 
Campo, M., 1985, "Bovine Papilloma virus DNA a Eukaryotic Cloning Vector" in DNA Cloning 
vol. II a Practical Approach Ed. D.M. Glover, IRL Press, Arlington, Virginia pp. 213-238. 

[00115] The host cells are competent or rendered competent for transformation by 
25 various means. There are several well-known methods of introducing DNA into animal cells. 
These include: calcium phosphate precipitation, fusion of the recipient cells with bacterial 
protoplasts containing the DNA, treatment of the recipient cells with liposomes containing the 
DNA, DEAE dextran, electroporation and micro-injection of the DNA directly into the cells. 

[00116] The transformed cells are cultured by means well known in the art. 
30 Biochemical Methods in Cell Culture and Virology, Kuchler, R.J., Dowden, Hutchinson and 
Ross, Inc., (1977). The expressed polypeptides are isolated from cells grown as suspensions or 
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as monolayers. The latter are recovered by well known mechanical, chemical or enzymatic 
means. 

Purification 

5 [00117] The proteins produced by recombinant DNA technology may be purified by 

standard techniques well known to those of skill in the art. Recombinantly produced proteins 
can be directly expressed or expressed as fusion proteins. The protein is then purified by a 
combination of cell lysis (e.g., sonication) and affinity chromatography. For fusion products, 
subsequent digestion of the fusion protein with an appropriate proteolytic enzyme releases the 
10 desired polypeptide. 

[00118] The polypeptides of this invention may be purified to substantial purity by 
standard techniques well known in the art, including selective precipitation with such substances 
li! as ammonium sulfate, column chromatography, immunopurification methods, and others. See, 
for instance, R. Scopes, Protein Purification: Principles and Practice, Springer- Verlag: New York 
|15 (1982), incorporated herein by reference. For example, antibodies may be raised to the proteins 
i-. of the invention as described herein. Cell membranes are isolated from a cell line expressing the 
|T recombinant protein, the protein is extracted from the membranes and immunoprecipitated. The 
W proteins may then be further purified by standard protein chemistry techniques as described 
Q above. 
^0 

In Vitro Diagnostic Methods 

[00119] The present invention provides, inter alia, methods for detecting DNA or RNA 
encoding the proteins of the invention and for measuring the proteins by immunoassay 
techniques. These methods are useful for two general purposes. First, assays for detection of 
25 nucleic acids encoding the proteins of the invention are useful for the isolation of these nucleic 
acids from a variety of vertebrate species according to the methods described in section (B) 
above and by use of the nucleic acid hybridization assays described below. 

[00120] The nucleic acid hybridization assays and the immunoassays described below 
are also useful as in vitro diagnostic assays for disorders in which alterations in the proteins of 
30 the invention or related proteins play a role. 

Nucleic Acid-Based Diagnostic Assays 
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[00121] A variety of methods for specific DNA and RNA measurement using nucleic 
acid hybridization techniques are known to those of skill in the art. See Sambrook, et al. For 
example, one method for evaluating the presence or absence of the nucleic acids of the invention 
in a sample involves a Southern transfer. Briefly, the digested genomic DNA is run on agarose 
5 slab gels in buffer and transferred to membranes. Hybridization is carried out using the nucleic 
acid probes discussed above. As described above, nucleic acid probes are designed based on the 
nucleic acid sequences of the invention. The probes can be full length or less than the full length 
of the nucleic acid sequence encoding the hCdc5 protein. Shorter probes are empirically tested 
for specificity. Preferably nucleic acid probes are 20 bases or longer in length. (See Sambrook, 
10 et al. for methods of selecting nucleic acid probe sequences for use in nucleic acid 
i-n- hybridization.) Visualization of the hybridized portions allows the qualitative determination of 

the presence or absence of DNA encoding hCdc5 proteins. 
m [00122] Similarly, a Northern transfer may be used for the detection of mRNA 

=4 encoding the proteins of the invention. In brief, the mRNA is isolated from a given cell sample 
fl 5 using an acid guanidinium-phenol-chloroforrn extraction method. The mRNA is then 
electrophoresed to separate the mRNA species and the mRNA is transferred from the gel to a 
n. nitrocellulose membrane. As with the Southern blots, labeled probes are used to identify the 
^ presence or absence of the proteins of the invention 

□ [00123] A variety of nucleic acid hybridization formats are known to those skilled in 

'"70 the art. For example, common formats include sandwich assays and competition or displacement 

assays. Hybridization techniques are generally described in Nucleic Acid Hybridization. A 

Practical Approach Ed. Hames, B.D. and Higgins, S.J., IRL Press (1985); Gall and Pardue Proc. 

Natl. Acad. Sci. U.S.A. 63:378-383 (1969); and John, Burnsteil and Jones Nature 223:582-587 

(1969). 

25 [00124] For example, sandwich assays are commercially useful hybridization assays for 

detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid 
covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. The 
clinical sample will provide the target nucleic acid. The "capture" nucleic acid and "signal" 
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich" hybridization 

30 complex. To be effective, the signal nucleic acid cannot hybridize with the capture nucleic acid. 
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[00125] Typically, labeled signal nucleic acids are used to detect hybridization. 
Complementary nucleic acids or signal nucleic acids may be labeled by any one of several 
methods typically used to detect the presence of hybridized polynucleotides. The most common 
method of detection is the use of autoradiography with 3 H, l25 1, 35 S, 14 C, or 32 P-labeled probes or 
5 the like. Other labels include ligands which bind to labeled antibodies, fluorophores, 
chemiluminescent agents, enzymes, and antibodies which can serve as specific binding pair 
members for a labeled ligand. 

[00126] Detection of a hybridization complex may require the binding of a signal 
generating complex to a duplex of target and probe polynucleotides or nucleic acids. Typically, 
10 such binding occurs through ligand and anti-ligand interactions as between a ligand-conjugated 
l«v probe and an anti-ligand conjugated with a signal. 

^ [00127] The label may also allow indirect detection of the hybridization complex. For 

111 example, where the label is a hapten or antigen, the sample can be detected by using antibodies. 
i4 In these systems, a signal is generated by attaching fluorescent or enzyme molecules to the 
■H5 antibodies or, in some cases, by attachment to a radioactive label. (Tijssen, P., "Practice and 
S : Theory of Enzyme Immunoassays," Laboratory Techniques in Biochemistry and Molecular 
J~; Biology Burdon, R.H., van Knippenberg, P.H., Eds., Elsevier (1985), pp. 9-20.) 
1=3 [00128] The sensitivity of the hybridization assays may be enhanced through use of a 

O nucleic acid amplification system which multiplies the target nucleic acid being detected. 
; i20 Examples of such systems include the polymerase chain reaction (PCR) system and the ligase 
chain reaction (LCR) system. Other methods recently described in the art are the nucleic acid 
sequence based amplification (NASBA™, Cangene, Mississauga, Ontario) and Q Beta Replicase 
systems. 

[00129] An alternative means for determining the level of expression of a gene 
25 encoding the proteins of the invention is in situ hybridization. In situ hybridization assays are 
well known and are generally described in Angerer et al. Methods Enzymol. 152:649-660 
(1987). In an in situ hybridization assay, cells are fixed to a solid support, typically a glass slide. 
If DNA is to be probed, the cells are denatured with heat or alkali. The cells are then contacted 
with a hybridization solution at a moderate temperature to permit annealing of labeled probes 
30 specific to the proteins of the invention. The probes are preferably labeled with radioisotopes or 
fluorescent reporters. 
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[00130] Methods of screening nucleic acid for mutations are well known in the art, 
including, but not limited to, restriction-fragment-length-polymorphism detection based on 
allele-specific restriction-endonuclease cleavage (Kan and Dozy Lancet ii:910-912 (1978)), 
hybridization with allele-specific oligonucleotide probes (Wallace et al. Nucl Acids Res 6:3543- 
5 3557 (1978)), including immobilized oligonucleotides (Saiki et al. Proc. Natl. Acad. Sci. U.S.A. 
86:6230-6234 (1989)) or oligonucleotide arrays (Maskos and Southern Nucl Acids Res 21:2269- 
2270 (1993)), allele-specific PCR (Newton et al. Nucl Acids Res 17:2503-2516 (1989)), 
mismatch-repair detection (MRD) (Faham and Cox Genome Res 5:474-482 (1995)), binding of 
MutS protein (Wagner et al. Nucl Acids Res 23:3944-3948 (1995), denaturing-gradient gel 
10 electrophoresis (DGGE) (Fisher and Lerman et al. Proc. Natl. Acad. Sci. U.S.A. 80:1579-1583 
(1983)), single-strand-conformation-polymorphism detection (Orita et al. Genomics 5:874-879 
(1983)), RNAase cleavage at mismatched base-pairs (Myers et al. Science 230:1242 (1985)), 
SJ1 chemical (Cotton et al. Proc. Natl. Acad. Sci. U.S.A. 85:4397-4401 (1988)) or enzymatic (Youil 
q et al. Proc. Natl. Acad. Sci. U.S.A. 92:87-91 (1995)) cleavage of heteroduplex DNA, methods 
[15 based on allele specific primer extension (Syvanen et al. Genomics 8:684-692 (1990)), genetic 
»! bit analysis (GBA) (Nikiforov et al. Nucl Acids Res 22:4167-4175 (1994)), the oligonucleotide- 
l2 ligation assay (OLA) (Landegren et al. Science 241:1077 (1988)), the allele-specific ligation 

g chain reaction (LCR) (Barrany Proc. Natl. Acad. Sci. U.S.A. 88:189-193 (1991)), gap-LCR 

ill 

O (Abravaya et al. Nucl Acids Res 23:675-682 (1995)), and radioactive and/or fluorescent DNA 

20 sequencing using standard procedures well known in the art. 

Production of Antibodies and Development of Immunoassays 

[00131] Immunoassays can be used to qualitatively or quantitatively analyze for the 
proteins of the invention. A general overview of the applicable technology can be found in 
25 Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Pubs., N.Y. (1988), 
incorporated herein by reference. 
Antibody Production 

[00132] A number of immunogens may be used to produce antibodies specifically 
reactive with the proteins of the invention. Recombinant protein is the preferred immunogen for 
30 the production of monoclonal or polyclonal antibodies. Naturally occurring protein may also be 
used either in pure or impure form. Synthetic peptides made using the protein sequences 
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described herein may also used as an immunogen for the production of antibodies to the protein. 
Recombinant protein can be expressed in eukaryotic or prokaryotic cells as described above, and 
purified as generally described above. The product is then injected into an animal capable of 
producing antibodies. Either monoclonal or polyclonal antibodies may be generated, for 
5 subsequent use in immunoassays to measure the protein. 

[00133] Methods of production of polyclonal antibodies are known to those of skill in 
the art. In brief, an immunogen, preferably a purified protein, is mixed with an adjuvant and 
animals are immunized. The animal's immune response to the immunogen preparation is 
monitored by taking test bleeds and determining the titer of reactivity to the hCdc5 protein. 
10 When appropriately high titers of antibody to the immunogen are obtained, blood is collected 
e% from the animal and antisera are prepared. Further fractionation of the antisera to enrich for 
;H antibodies reactive to the protein can be done if desired. (See Harlow and Lane, supra), 
m [00134] Monoclonal antibodies may be obtained by various techniques familiar to those 

;i skilled in the art. Briefly, spleen cells from an animal immunized with a desired antigen are 
"{35 immortalized, commonly by fusion with a myeloma cell (See, Kohler and Milstein, Eur. J. 

Immunol. 6:511-519 (1976), incorporated herein by reference). Alternative methods of 
'ri immortalization include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or 
G other methods well known in the art. Colonies arising from single immortalized cells are 
q screened for production of antibodies of the desired specificity and affinity for the antigen, and 
K 20 yield of the monoclonal antibodies produced by such cells may be enhanced by various 
techniques, including injection into the peritoneal cavity of a vertebrate host. Alternatively, one 
may isolate DNA sequences which encode a monoclonal antibody or a binding fragment thereof 
by screening a DNA library from human B cells according to the general protocol outlined by 
Huse, etal. Science 246:1275-1281 (1989). 
25 [00135] Methods of production of synthetic peptides are known to those of skill in the 

art. Peptides preferably at least 1 0 amino acids in length are synthesized corresponding to these 
regions and the peptides are conjugated to larger protein molecules for subsequent immunization. 
Preferably, peptide sequences corresponding to regions of interest of a recombinant protein of 
the invention is used to generate antibodies specifically immunoreactive with the protein. 
30 Production of monoclonal or polyclonal antibodies is then carried out as described above. 
Immunoassays 
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[00136] A particular protein can be measured by a variety of immunoassay methods. 
For a review of immunological and immunoassay procedures in general, see Basic and Clinical 
Immunology 7th Edition (D. Stites and A. Terr ed.) (1991). Moreover, immunoassays can be 
performed in any of several configurations, which are reviewed extensively in Enzyme 
5 Immunoassay, E.T. Maggio, ed., CRC Press, Boca Raton, Florida (1980); "Practice and Theory 
of Enzyme Immunoassays," P. Tijssen, Laboratory Techniques in Biochemistry and Molecular 
Biology, Elsevier Science Publishers B.V. Amsterdam (1985); and, Harlow and Lane, 
Antibodies, A Laboratory Manual, supra, each of which is incorporated herein by reference. 

[00137] Immunoassays for measurement of the proteins of the invention can be 
10 performed by a variety of methods known to those skilled in the art. In brief, immunoassays to 
s -=» measure the protein can be either competitive or noncompetitive binding assays. In competitive 
'0 binding assays, the sample analyte competes with a labeled analyte for specific binding sites on a 
m capture agent bound to a solid surface. Preferably the capture agent is an antibody specifically 
=~ reactive with a recombinant protein of the invention produced as described above. The 
'Ii5 concentration of labeled analyte bound to the capture agent is inversely proportional to the 
Ei amount of free analyte present in the sample. 

\ J [00138] Western blot analysis can also be done to determine the presence of a protein 

O of the invention in a sample. Electrophoresis is carried out, for example, on a tissue sample 
n suspected of containing the protein. Following electrophoresis to separate the proteins, and 
^0 transfer of the proteins to a suitable solid support such as a nitrocellulose filter, the solid support 
is then incubated with an antibody reactive with the protein. This antibody may be labeled, or 
alternatively may be it may be detected by subsequent incubation with a second labeled antibody 
that binds the primary antibody. 

[00139] The immunoassay formats described above employ labeled assay components. 
25 The label can be in a variety of forms. The label may be coupled directly or indirectly to the 
desired component of the assay according to methods well known in the art. A wide variety of 
labels may be used. The component may be labeled by any one of several methods. 
Traditionally a radioactive label incorporating 3 H, 125 I, 35 S, 14 C, or 32 P was used. Non- 
radioactive labels include ligands which bind to labeled antibodies, fluorophores, 
30 chemiluminescent agents, enzymes, and antibodies which can serve as specific binding pair 
members for a labeled ligand. The choice of label depends on sensitivity required, ease of 
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conjugation with the compound, stability requirements, and available instrumentation. For a 
review of various labeling or signal producing systems which may be used, see U.S. Patent No. 
4,391,904, which is incorporated herein by reference. 

[00140] Antibodies reactive with a particular protein can also be measured by a variety 
5 of immunoassay methods. For a review of immunological and immunoassay procedures 
applicable to the measurement of antibodies by immunoassay techniques, see Basic and Clinical 
Immunology 7th Edition (D. Stites and A. Terr ed.) supra, Enzyme Immunoassay, E.T. Maggio, 
ed., supra, and Harlow and Lane, Antibodies, A Laboratory Manual, supra. 

10 Kits 

[00141] This invention also embraces kits for detecting the presence of a protein of the 
*D invention in tissue or blood samples which comprise a container containing antibodies 
Ifl selectively immunoreactive to the protein and instructional material for performing the test. The 
^ kit may also contain other components such as a protein of the invention, controls, buffer 
=fi 5 solutions, and secondary antibodies. Kits for detecting antibodies to a protein of the invention 
=, comprise a container containing an a protein of the invention, instructional material and may 
j s : comprise other materials such as secondary antibodies and labels as described herein. 
O [00142] This invention further embraces diagnostic kits for detecting DNA or RNA 

n encoding proteins of interest in tissue or blood samples which comprise nucleic acid probes as 
'i>0 described herein and instructional material. The kit may also contain additional components 
such as labeled compounds, as described herein, for identification of duplexed nucleic acids. 

[00143] This invention also includes kits for identifying effector compounds of hCdc5- 
DNA binding comprising hCdc5 binding site nucleic acid and hCdc5 polypeptide, a biologically 
active fragment or a variant thereof. 
25 [00144] This invention further contemplates kits for delivery of a therapeutic 

composition of the present invention to a mammal in need thereof. By way of illustration, in one 
embodiment, the kit comprises the therapeutic composition (e.g., an adenoviral vector containing 
a nucleic acid encoding hCdc5) in a pharmaceutically acceptable solution and a catheter for 
administration of the composition to a target organ, e.g., a heart. The kit may further comprise 
30 instructions for dosing and/or administration of the composition. 
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Formulations 

[00145] The compositions of the invention will be formulated for administration by 
manners known in the art acceptable for administration to a mammalian subject, preferably a 
human. In some embodiments of the invention, the compositions of the invention can be 
5 administered directly into a tissue by injection or into a blood vessel supplying the tissue of 
interest. In further embodiments of the invention the compositions of the invention are 
administered "locoregionally", i.e., intravesically, intralesionally, and/or topically. In other 
embodiments of the invention, the compositions of the invention are administered systemically 
by injection, inhalation, suppository, transdermal delivery, etc. In further embodiments of the 
10 invention, the compositions are administered through catheters or other devices to allow access 
i«-s to a remote tissue of interest, such as an internal organ. The compositions of the invention can 
also be administered in depot type devices, implants, or encapsulated formulations to allow slow 
I J] or sustained release of the compositions. 

A [00146] In order to administer therapeutic agents based on, or derived from, the present 

i.l 5 invention, it will be appreciated that suitable carriers, excipients, and other agents may be 

incorporated into the formulations to provide improved transfer, delivery, tolerance, and the like. 
; = : [00147] A multitude of appropriate formulations can be found in the formulary known 

O to all pharmaceutical chemists: Remington's Pharmaceutical Sciences, (15th Edition, Mack 
q Publishing Company, Easton, Pennsylvania (1975)), particularly Chapter 87, by Blaug, Seymour, 
-^0 therein. These formulations include for example, powders, pastes, ointments, jelly, waxes, oils, 

lipids, anhydrous absorption bases, oil-in-water or water-in-oil emulsions, emulsions carbowax 

(polyethylene glycols of a variety of molecular weights), semi-solid gels, and semi-solid 

mixtures containing carbowax. 

[00148] Any of the foregoing formulations may be appropriate in treatments and 
25 therapies in accordance with the present invention, provided that the active agent in the 

formulation is not inactivated by the formulation and the formulation is physiologically 

compatible. 

[00149] The quantities of active ingredient necessary for effective therapy will depend 
on many different factors, including means of administration, target site, physiological state of 
30 the patient, and other medicaments administered. Thus, treatment dosages should be titrated to 
optimize safety and efficacy. Typically, dosages used in vitro may provide useful guidance in 
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the amounts useful for in situ administration of the active ingredients. Animal testing of 
effective doses for treatment of particular disorders will provide further predictive indication of 
human dosage. Various considerations are described, for example, in Goodman and Gilman's 
the Pharmacological Basis of Therapeutics, 7th Edition (1985), MacMillan Publishing Company, 
5 New York, and Remington's Pharmaceutical Sciences 18th Edition, (1990) Mack Publishing Co, 
Easton Penn. Methods for administration are discussed therein, including oral, intravenous, 
intraperitoneal, intramuscular, transdermal, nasal, iontophoretic administration, and the like. 

[00150] The compositions of the invention may be administered in a variety of unit 
dosage forms depending on the method of administration. For example, unit dosage forms 
10 suitable for oral administration include solid dosage forms such as powder, tablets, pills, 
capsules, and dragees, and liquid dosage forms, such as elixirs, syrups, and suspensions. The 
*f active ingredients may also be administered parenterally in sterile liquid dosage forms. Gelatin 
In capsules contain the active ingredient and as inactive ingredients powdered carriers, such as 
i~k glucose, lactose, sucrose, mannitol, starch, cellulose or cellulose derivatives, magnesium 
s #i 5 stearate, stearic acid, sodium saccharin, talcum, magnesium carbonate and the like. Examples of 
additional inactive ingredients that may be added to provide desirable color, taste, stability, 
!*? buffering capacity, dispersion or other known desirable features are red iron oxide, silica gel, 
O sodium lauryl sulfate, titanium dioxide, edible white ink and the like. Similar diluents can be 
Ft used to make compressed tablets. Both tablets and capsules can be manufactured as sustained 
p ^0 release products to provide for continuous release of medication over a period of hours. 
Compressed tablets can be sugar coated or film coated to mask any unpleasant taste and protect 
the tablet from the atmosphere, or enteric-coated for selective disintegration in the 
gastrointestinal tract. Liquid dosage forms for oral administration can contain coloring and 
flavoring to increase patient acceptance. 
25 [00151] The concentration of the compositions of the invention in the pharmaceutical 

formulations can vary widely, i.e., from less than about 0.1%, usually at or at least about 2% to 
as much as 20% to 50% or more by weight, and will be selected primarily by fluid volumes, 
viscosities, etc., in accordance with the particular mode of administration selected. 

[00152] The compositions of the invention may also be administered via liposomes. 
30 Liposomes include emulsions, foams, micelles, insoluble monolayers, liquid crystals, 
phospholipid dispersions, lamellar layers and the like. In these preparations the composition of 
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the invention to be delivered is incorporated as part of a liposome, alone or in conjunction with a 
molecule which binds to a desired target, such as antibody, or with other therapeutic or 
immunogenic compositions. Thus, liposomes either filled or decorated with a desired 
composition of the invention of the invention can delivered systemically, or can be directed to a 
5 tissue of interest, where the liposomes then deliver the selected therapeutic/immunogenic peptide 
compositions. 

[00153] Liposomes for use in the invention are formed from standard vesicle-forming 
lipids, which generally include neutral and negatively charged phospholipids and a sterol, such 
as cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
1 0 size, acid lability and stability of the liposomes in the blood stream. A variety of methods are 
=~ available for preparing liposomes, as described in, e.g., Szoka et al. Ann. Rev. Biophys. Bioeng, 
fi 9:467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 5,019,369, incorporated 
ill herein by reference. 

^ [00154] A liposome suspension containing a composition of the invention may be 

=fi 5 administered intravenously, locally, topically, etc. in a dose which varies according to, inter alia, 
s i the manner of administration, the composition of the invention being delivered, and the stage of 
rl the disease being treated. 

13 [00155] For solid compositions, conventional nontoxic solid carriers may be used 

p% which include, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium 
H 20 stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the 
like. For oral administration, a pharmaceutically acceptable nontoxic composition is formed by 
incorporating any of the normally employed excipients, such as those carriers previously listed, 
and generally 10-95% of active ingredient, that is, one or more compositions of the invention of 
the invention, and more preferably at a concentration of 25%-75%. 
25 [00156] For aerosol administration, the compositions of the invention are preferably 

supplied in finely divided form along with a surfactant and propellant. Typical percentages of 
compositions of the invention are 0.01%-20% by weight, preferably 1%-10%. The surfactant 
must, of course, be nontoxic, and preferably soluble in the propellant. Representative of such 
agents are the esters or partial esters of fatty acids containing from 6 to 22 carbon atoms, such as 
30 caproic, octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or natural 
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glycerides may be employed. The surfactant may constitute 0.1%-20% by weight of the 
composition, preferably 0.25-5%. The balance of the composition is ordinarily propellant. A 
carrier can also be included, as desired, as with, e.g., lecithin for intranasal delivery. 

[00157] The compositions of the invention can additionally be delivered in a depot-type 
5 system, an encapsulated form, or an implant by techniques well-known in the art. Similarly, the 
compositions can be delivered via a pump to a tissue of interest. 

[00158] The compositions of the invention are typically administered to patients after 
the onset of symptoms, although treatment can also be prophylactic in some embodiments. 
Typically, treatment with direct administration of polypeptides is done daily, weekly, or 
1 0 monthly, for a period of time sufficient to reduce, prevent, or ameliorate symptoms. Treatment 
ri: with the nucleic acids of the invention is typically done at intervals of several months. In some 
•O embodiments, administration of the compositions of the invention is done in utero. 

Gene Therapy 

-%5 [00159] Gene therapy utilizing recombinant DNA technology to deliver nucleic acids 

: ~ encoding HCdc5 polypeptides, or antagonists or agonists of hCdc5 into patient cells or vectors 
ri which will supply the patient with gene product in vivo is also contemplated within the scope of 
O the present invention. 

ri [00160] Gene therapy techniques have the potential for limiting the exposure of a 

I= 20 subject to a gene product, such as a hCdc5 polypeptide, by targeting the expression of the 
therapeutic gene to a tissue of interest, such as skeletal muscle, myocardium, vascular 
endothelium or smooth muscle, or solid or circulating tumor cells. For example, WIPO Patent 
Application Publication No. WO 93/15609 discloses the delivery of interferon genes to vascular 
tissue by administration of such genes to areas of vessel wall injury using a catheter system. In 
25 another example, an adenoviral vector encoding a protein capable of enzymatically converting a 
prodrug, a "suicide gene", and a gene encoding a cytokine are administered directly into a solid 
tumor. In yet another example, an adenoviral vector encoding an angiogenic protein or peptide is 
administered non-surgically via catheter into a vessel supplying blood to a target tissue. (See, 
WIPO Patent Application Publication No. WO 96/26742 published 1 1/06/96 and corresponding 
30 U.S. Patent Nos. 5,792,453, issued 08/1 1/98 and 6,100,242, issued 08/08/00.) Other techniques 
for both surgical and non-surgical in vivo delivery of gene therapy products are likewise well 
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known to those of skill in the art and thus contemplated herein, such as, for example direct 
injection into a target tissue. 

[00161] Other methods of targeting therapeutic genes to tissues of interest include the 
three general categories of transductional targeting, positional targeting, and transcriptional 
5 targeting (for a review, see, e.g., Miller et al. FASEB J. 9:190-199 (1995)). Transductional 
targeting refers to the selective entry into specific cells, achieved primarily by selection of a 
receptor ligand. Positional targeting within the genome refers to integration into desirable loci, 
such as active regions of chromatin, or through homologous recombination with an endogenous 
nucleotide sequence such as a target gene. Transcriptional targeting refers to selective 
10 expression attained by the incorporation of transcriptional promoters with highly specific 

regulation of gene expression tailored to the cells of interest. 
=D [00162] Examples of tissue-specific promoters include a liver-specific promoter (Zou et 

m al. Endocrinology 138:1771-1774 (1997)); a small intestine-specific promoter (Olivera et al. J. 
>^ Biol. Chem. 271:31831-31838 (1996)); the promoter for creatine kinase, which has been used to 
fi 5 direct the expression of dystrophin cDNA expression in muscle and cardiac tissue (Cox et al. 
Ei "~ Nature 364:725-729 (1993)); and immunoglobulin heavy or light chain promoters for the 
r: expression of suicide genes in B cells (Maxwell et al. Cancer Res. 51:4299-4304 (1991)). An 
O endothelial cell-specific regulatory region has also been characterized (Jahroudi et al. Mol. Cell, 
n Biol. 14:999-1008 (1994)). Amphotrophic retroviral vectors have been constructed carrying a 
f i20 herpes simplex virus thymidine kinase gene under the control of either the albumin or alpha- 
fetoprotein promoters (Huber et al. Proc. Natl. Acad. Sci. U.S.A. 88:8039-8043 (1991)) to target 
cells of liver lineage and hepatoma cells, respectively. Such tissue specific promoters can be 
used in retroviral vectors (Hartzoglou et al. J. Biol. Chem. 265:17285-17293 (1990)) and 
adenovirus vectors (Friedman et al. Mol. Cell. Biol. 6:3791-3797 (1986)) and still retain their 
25 tissue specificity. 

[00163] Other elements aiding specificity of expression in a tissue of interest can 
include secretion leader sequences, enhancers, nuclear localization signals, endosmolytic 
peptides, etc. Preferably, these elements are derived from the tissue of interest to aid specificity. 
[00164] Viral vector systems useful in the practice of the instant invention include but 
30 are not limited to adenovirus, herpesvirus, adeno-associated virus, minute virus of mice (MVM), 
HIV, sindbis virus, and retroviruses such as Rous sarcoma virus, and MoMLV. Typically, the 
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nucleic acid encoding the therapeutic polypeptide of interest is inserted into such vectors to allow 
packaging of the nucleic acid, typically with accompanying viral DNA, infection of a sensitive 
host cell, and expression of the polypeptide of interest. 

[00165] In still other embodiments of the invention, nucleic acid encoding a therapeutic 
5 polypeptide of interest is conjugated to a cell receptor ligand for facilitated uptake (e.g., 
invagination of coated pits and internalization of the endosome) through a DNA linking moiety 
(Wu et al. J. Biol. Chem. 263:14621-14624 (1988); WO 92/06180). For example, the DNA 
constructs of the invention can be linked through a polylysine moiety to asialo-oromucoid, which 
is a ligand for the asialoglycoprotein receptor of hepatocytes. 
10 [00166] Similarly, viral envelopes used for packaging the recombinant constructs of the 

i«i invention can be modified by the addition of receptor ligands or antibodies specific for a receptor 
•0 to permit receptor-mediated endocytosis into specific cells (e.g., WO 93/20221, WO 93/14188; 
\f\ WO 94/06923). In some embodiments of the invention, the DNA constructs of the invention are 
;=j linked to viral proteins, such as adenovirus particles, to facilitate endocytosis (Curiel et al. Proc. 
--H5 Natl. Acad. Sci. U.S.A. 88:8850-8854 (1991)). In other embodiments, molecular conjugates of 
i~ the instant invention can include microtubule inhibitors (wo/9406922); synthetic peptides 
[ 3 mimicking influenza virus hemagglutinin (Plank et al. J. Biol. Chem. 269:12918-12924 (1994)); 
O and nuclear localization signals such as SV40 T antigen (W093/19768). 

\ ^ [00167] The nucleic acid can be introduced into the tissue of interest in vivo or ex vivo 

^20 by a variety of methods. In some embodiments of the invention, the nucleic acid is introduced to 

cells by such methods as microinjection, calcium phosphate precipitation, liposome fusion, or 

biolistics. In further embodiments, the nucleic acid is taken up directly by the tissue of interest. 

In other embodiments, nucleic acid is packaged into a viral vector system to facilitate 

introduction into cells. 

25 [00168] In some embodiments of the invention, the compositions of the invention are 

administered ex vivo to cells or tissues explanted from a patient, then returned to the patient. 
Examples of ex vivo administration of gene therapy constructs include Arteaga et al. Cancer 
Research 56(5) :1098-1 103 (1996); Nolta et al. Proc Natl. Acad. Sci. USA 93 (6):2414-9 (1996); 
Koc et al. Seminars in Oncology 23 (1) :46-65 (1996); Raper et al. Annals of Surgery 

30 223(2):1 16-26 (1996); Dalesandro et al. J. Thorac. Cardi. Surg. 11(2) :416-22 (1996); and 
Makarov et al. Proc. Natl. Acad. Sci. USA 93(l):402-6 (1996). 
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[00169] The following examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill in the 
art and are encompassed by the appended claims. 



5 Experimental Examples 

Identification and cloning of the hCdc5 Gene 

Cloning, Sequencing, and Site-directed Mutagenesis 

[00170] A novel 1.45-kb cDNA, clone 67, was isolated during a yeast two-hybrid 
screen of a HeLa cDNA library in pGADGH (Harmon, G. et al., Genes Dev. 7:2378-2391 
10 (1993)) provided by R. Derynck, University of California San Francisco) with cytoplasmic 
m domains (amino acids 775-799, 1094-1115, and 1274-1512) of the human thrombin receptor 
(Vu, T. et al., Cell 64:1057-1068 (1991)) in the GAL4 binding domain vector, pASl-CYH 
in (Durfee, T. et al., Genes Devel. 7:555-569 (1993)) (provided by S. Elledge, Baylor College of 
Medicine). This was sequenced (Sanger, F. et al., Proc. Natl. Acad. Sci. U.S.A. 74:5463-5467 
'H5 1977)) and used to screen a HeLa cDNA library in Uni-ZAP XR (Stratagene) for full-length 
clones. The 1.45-kb cDNA insert from clone 67 was labeled using the ECL enhanced 
!°7 chemiluminescence system (Amersham Corp.) according to the protocol provided, and plaques 
O were screened using standard techniques (Sambrook, J. et al., Molecular Cloning: A Laboratory 
n Manual, 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989)) as modified 
ia 20 in the ECL protocol. Eleven Uni-ZAP XR clones were excised in vivo into SK phagemid 
according to the protocol from Stratagene. The 2.8-kb insert from phagemid clone 67.1 was 
sequenced (Sanger, F. et al., Proc. Natl. Acad. Sci. U.S.A. 74:5463-5467 (1977)), and identity 
with pGADGH clone 67 was established. Clone 67.1 was modified after ultimate codon 802 to 
add an epitope (DYKDDDK, SEQ ID NO: 12) recognized by monoclonal antibody M2 
25 (Kodak/IBI) using site-directed mutagenesis (Kunkel, T.A. et al., Methods Enzymol. 154:367- 
382 (1987). The epitope-tagged insert was then excised from SK at Notl/Apal restriction sites 
and subcloned into these same sites in pcDNA3 (Invitrogen) (Sambrook, J. et al., Molecular 
Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY (1989)). This clone, pcD67F, was used in all further studies, as described below. 
30 [00171] Related sequences were identified using the basic local alignment search tool 

(Altschul, S. et al., J. Mol. Biol. 215:403-410 (1990) and sequence data bases available from the 
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National Center for Biotechnology Information. Alignments and Pustell dot matrix homology 
analyses were performed using Mac Vector sequence analysis software (Oxford). 

Northern Analysis 

5 [00172] The human multiple tissue Northern blot (Clontech) was prehybridized in 750 

mM sodium chloride, 50 raM sodium phosphate, pH 7.4, 5 mM EDTA, 0.2% Ficoll, 0.2% 
polyvinylpyrrolidone, 0.2% bovine serum albumin, 50% deionized formamide, 2% sodium 
dodecyl sulfate, 100 ug/ml sheared salmon sperm DNA (Sigma) at 42°C for 4 h, then hybridized 
in the same solution containing 1.2 x 10 6 cpm/ml labeled probe at 42°C for 24 h. Following 
10 hybridization, the blot was rinsed three times in 300 mM sodium chloride, 30 mM sodium 
p citrate, pH 7, 0.05% sodium dodecyl sulfate at room temperature, then three times in the same 
:^ solution at room temperature for 10 min. then two times in 15 mM sodium chloride, 1.5 mM 
U1 sodium citrate, pH 7, 0.1% sodium dodecyl sulfate at 50°C for 20 min. The washed membrane 
n was exposed to x-ray film for 24 h with one intensifying screen at -70°C. The Notl/Apal 
%5 fragment of pcD67F or a human (3-actin cDNA control (Clontech) was labeled to a specific 
activity of 3 x 10 8 cpm/|ag using the Prime-It II random primer labeling kit (Stratagene) 
y> according to the manufacturer's instructions. The blot was stripped between hybridizations by 
washing two times in 0.5% sodium dodecyl sulfate at 100°C for 10 min. 

20 Cell Culture and Transfection 

[00173] COS-7 and CV-1 cell lines were maintained in DMEM H-16 medium with 3 

g/liter glucose (Life Technologies, Inc.) and 10% bovine calf serum (Life Technologies, Inc.). 

Transfections were performed using LipofectAMINE (Life Technologies, Inc.) according to the 

protocol provided by the manufacturer. Transiently transfected cells were allowed to recover in 
25 DMEM with serum for 12-18 h. then incubated an additional 18-24 h. in serum-free DMEM 

containing 0.1% bovine serum albumin. Serum-deprived cells then were stimulated with DMEM 

with or without 10% bovine calf serum or 0.1% bovine serum albumin. 

Western Analysis 

30 [00174] Cell monolayers were placed on ice and rinsed with cold phosphate-buffered 

saline with protease/phosphatase inhibitors (100 ng/ml phenylmethylsulfonyl fluoride, 2 |ig/ml 
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aprotinin, 1 ug/ml leupeptin, 1 ug/ml soybean trypsin inhibitor, 1 ug/ml pepstatin A, 10 ug/ml 
benzamidine, 20 uM okadaic acid, 200 uM sodium orthovanadate, 10 mM sodium 
pyrophosphate). Cells were then scraped into the rinsing buffer and pelleted at 1,500 x g at 4°C. 
Pellets were resuspended in an equal volume of 2 x gel loading buffer (100 mM Tris, pH 6.8, 4% 
5 sodium dodecyl sulfate, 20% glycerol, 200 mM dithiothreitol, 0.2% bromphenol blue) and 
sheared by passage through 21- and 27-gauge syringe needles. Samples were analyzed by 
electrophoresis in 8% polyacrylamide and transferred to nitrocellulose according to standard 
protocols (Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989)) . 
1 0 [00175] Membranes were blocked in TNT (50 mM Tris, pH 7.4, 0.5% sodium chloride, 

p 0.05% Tween 20) with 5% non-fat dried milk at room temperature for 30 min., incubated in the 
?i same solution with 10 p.g/ml M2 antibody at 4°C for 18 h. washed in TNT, incubated with 
= horseradish peroxidase-conjugated goat anti-mouse antibody (Bio-Rad) at 1 :20,000 dilution in 
O TNT, 5% non-fat dried milk at room temperature for 30 min., and washed in TNT. Blots were 
U5 rinsed in 50 mM Tris, pH 7.4, 0.5% sodium chloride, then developed using ECL according to the 
manufacturer's instructions. 

*£i Immunofluorescence Microscopy 

O [00176] COS-7 or CV-1 cells were plated on sterile 4.8-cm 2 glass coverslips and 

' 20 transfected as described above. Following transfection and treatment with serum or albumin, 
cells were fixed at room temperature in 4% paraformaldehyde in phosphate-buffered saline. 
Coverslips were then rinsed with blocking solution (150 mM sodium acetate, pH 7, 0.1% non-fat 
dried milk in phosphate-buffered saline), permeabilized with 0.5% Triton X-100 in blocking 
solution, rinsed with washing solution (15 mM sodium acetate, pH 7, 0.1% non-fat dried milk in 
25 phosphate buffered saline), incubated with 10 ug/ml M2 antibody in washing solution, rinsed 
with washing solution, incubated with 2 p.g/ml fluorescein isothiocyanate-conjugated goat anti- 
mouse antibody (Life Technologies, Inc.) in washing solutions, and rinsed with phosphate- 
buffered saline, all at room temperature. Coverslips were mounted on slides using Slow Fade 
Light Anti-fade (Molecular Probes) and examined using a Nikon Microphot-FXA fluorescence 
30 microscope. 
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Labeling of Cells and Immunoprecipitation of hCdc5 

[00177] COS-7 cells were transiently transfected with pcD67F, serum-deprived, and 
labeled with [ 32 P] orthophosphoric acid (Dupont NEN) according to standard methods (Ausabel, 
F. et al., Short Protocols in Molecular Biology, John Wiley and Sons, New York, pp. 17.3-17.5 
5 (1995)). Following serum stimulation with phosphate-free DMEM (Life Technologies, Inc.) and 
10% dialyzed fetal calf serum (Life Technologies, Inc.), cells were placed on ice, scraped into 
cold phosphate-buffered saline with protease/phosphatase inhibitors, and pelleted at 1,500 x g at 
4°C. Cell pellets were resuspended in RIPA buffer (10 mM Tris, pH 7.4, 500 mM NaCl, 1 mM 
EDTA, 1% Triton X-100, 1% sodium deoxycholate, 0.5% sodium dodecyl sulfate) with 
10 protease/phosphatase inhibitors and sheared by passage through 21- and 27-gauge syringe 
needles. Cell lysates were extracted for 60 min. at 4°C and then centrifuged at 15,000 x g for 15 
i; JB min. at 4°C. Supernatants were precleared with protein A-Sepharose (Pharmacia Biotech Inc.) 
1/1 for 60 min. at 4°C and then incubated with 10 ug/ml M2 antibody for 3 h. at 4°C. Protein A- 
=4 Sepharose was added, and incubations were continued for 60 min. Immunoprecipitates were 
'H5 washed with RIPA buffer and then suspended in gel loading buffer and boiled prior to 
s, electrophoresis in 8% polyacrylamide. Dried gels were exposed to x-ray film with two 
r: intensifying screens for 4-72 h. at -70°C. 

O [00178] Densitometric analysis was accomplished by scanning representative 

pi autoradiographs into Photoshop (Adobe), then measuring signal densities using NIH Image 
- =i 20 (NIH). Signal densities were normalized against chemiluminescence signals obtained from 
Western blots of similarly treated, unlabeled samples. 

Results and Discussion 

[00179] The 1.45-kb partial cDNA identified during the yeast two-hybrid screen 
25 contained an open reading frame encoding a potential DNA binding domain. This cDNA was 
used to isolate a 2.85-kb cDNA from a HeLa cell cDNA library which contained a complete 
open reading frame encoding an 802-amino acid protein (Figure 1A (SEQ ID NO:l)). The 
protein sequence contained two tandem repeats of a helix-turn-helix DNA binding motif 
(Nelson, H.C. M., Curr. Opin. Genet. Dev. 5:180-189 (1995)), four consensus nuclear 
30 localization signals (Boulikas, T., Crit. Rev. Eukaryot. Gene Expr. 3:193-227 (1993)), a proline- 
rich, hydrophilic region similar to known activating domains (Mitchell, P.J. et al., Science 
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245:371-378 (1989); Tijan, R. et al., Cell 77:5-8 (1994)), and potential sites for phosphorylation 
by intracellular kinases (Figure 1 , A and B). 

[00180] A search for related cDNA sequences revealed significant homology with the 
cdc5 gene product in S. pombe (Ohi, R. et al., EMBO J. 13:471-483 (1994)). The two proteins 
5 were 75% identical over the 223 amino acids comprising the DNA binding and nuclear 
localization domains and 17% identical over the subsequent 535 amino acids (Figure 2A). 
Amino acids 4-56 and 57-107 of hCdc5 represent two tandem repeats of a helix-turn-helix motif 
similar to the DNA binding domain of Myb-related proteins (Figure 2B). This region was 83% 
identical to the analogous region (amino acids 2-103) of S. pombe Cdc5 (SEQ ID NO:3) (Ohi, R. 
10 et al., EMBO J. 13:471-483 (1994)) and 36-38% identical to the corresponding domains in the 
„ human Myb subfamilies (SEQ ID NO:4-6) (Majello, B. et al., Proc. Natl. Acad. Sci. U.S.A. 
?S 83:9636-9640 (1986); Nomura, N. et al., Nucl. Acids Res. 16:1 1075-1 1089 (1988)). Carboxyl to 
IJ1 this region was a hydrophilic stretch of 223 residues (amino acids 378-500, SEQ ID NO: 7), with 
,5 some homology to the activating domains of a- and b-Myb (SEQ ID NO: 10 and 9) (Majello, B. 

".xttir 

=11 5 et al., Proc. Natl. Acad. Sci. U.S.A. 83:9636-9640 (1986); Nomura, N. et al., Nucl. Acids Res. 
T 16:11075-11089 (1988) and S. pombe Cdc5 (SEQ ID NO: 8) (Ohi, R. et al., EMBO J. 13:471- 
p 483 (1994)) (Figure 2C). This analysis indicates that hCdc5 and S. pombe Cdc5 are related 
O proteins, with significant similarity to the Myb family. 

;S [00181] A search of expressed sequences revealed that homologous open reading 

^20 frames are present in M. musculus (GenBank Accession no. W82296), C. elegans (Wilson, R. et 
al., Nature 368:32-38 (1994)), and S. cerevisiae (GenBank Accession no. Z49809) (Figure 2A). 
The murine expressed sequence tag aligns with the carboxyl terminus of hCdc5 and is 97% 
identical over the 127 amino acids available, suggesting that this is almost certainly the murine 
homolog of hCdc5. The 528-amino acid polypeptide encoded by a C. elegans gene is 80% 
25 identical over the first 205 residues, including the DNA binding domain and nuclear localization 
signals of hCdc5, and 30% identical over the remaining 323 amino acids. An open reading 
frame in S. cerevisiae reveals 54% identity over the first 230 residues and 14% identity over the 
remaining 332 amino acids. The finding of open reading frames encoding proteins with similar 
domain arrangements and amino acid sequences in human, mouse, nematode, and budding and 
30 fission yeast indicates that hCdc5 is a member of a protein family that has been highly conserved 
throughout evolution. 
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[00182] Northern analysis at high stringency revealed a dominant band at 3.4 kb in all 
human tissues examined (Figure 3). In skeletal muscle, heart, pancreas, and placenta, a less 
prominent 8-9 kb band was observed. How this larger mRNA species related to the dominant 
3.4-kb species remains to be determined. Of note, the wide expression of hCdc5 in normal adult 
5 human tissues contrasts with c-Myb, which is expressed almost exclusively in hematopoietic 
cells (Luscher, B. et al., Genes Dev. 4:2235-2241 (1990); Thompson, M.A. et al., Bioassays 
17:341-350 (1995)). These findings suggest that hCdc5 may serve a more general function in 
transcriptional regulation. 

[00183] To determine its location in mammalian cells, an epitope-tagged version of 
10 hCdc5 was transiently expressed in CV-1 cells. Cells grown in standard culture medium with 
10% bovine calf serum demonstrated nuclear localization of hCdc5, consistent with a role in 
m transcriptional regulation. Remarkably, however, hCdc5 was found exclusively in the cytoplasm 
Ijl in transfected CV-1 cells deprived of serum. When these serum deprived cells were then 
stimulated with 10% bovine calf serum for 5, 15, or 60 min., hCdc5 was found solely in the 
=11 5 nucleus. This same phenomenon was observed in transiently transfected COS-7 cells. To the 
extent that the results obtained with epitope-tagged hCdc5 expressed in CV-1 or COS-7 cells 
r; accurately reflect the behavior of endogenous hCdc5, the rapid nuclear translocation of hCdc5 in 
Q response to serum stimulation suggests a role in relaying signals from the cell surface to the 
q nucleus. 

ps 20 [00184] Many transcription factors contain phosphorylation sites that regulate nuclear 

localization as well as DNA binding and transactivation (Vandromme, M. et al., Trends 
Biochem. Sci. 21:59-64 (1996)). hCdc5 contains 28 potential phosphorylation sites: 14 for 
recognition by casein kinase II (S/T-X-X-D/E), 9 for protein kinase C (S/T-X-R/K), 2 for protein 
kinase A (R/K-X-X-S/T), and 3 for MAP kinase (P-X-S/T-P or P-X-X-S/T-P) (see Figure 1). 
25 Western blot analysis of whole cell lysates from COS-7 cells expressing epitope-tagged hCdc5 
demonstrated a single, transfection-dependent band at about 105 kDa (Figure 4A). This is 
greater than the predicted size of 92.2 kDa, suggesting the possibility that hCdc5 either maintains 
a structure that slows its mobility on gel electrophoresis or carries post-translational 
modifications. 

30 [00185] To test the hypothesis that hCdc5 is a phosphoprotein, immunoprecipitates of 

32 P-labeled COS-7 cells expressing epitope-tagged hCdc5 were analyzed by sodium dodecyl 
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sulfate-polyacrylamide gel electrophoresis. Autoradiography revealed a transfection-dependent, 
32 P-labeled band of the appropriate molecular weight (Figure 4B). The intensity of this band 
doubled within 10 min. of serum stimulation, as quantitated by densitometry. Thus, although 
recombinant hCdc5 was phosphorylated in non-stimulated COS-7 cells, the degree of 
5 phosphorylation or the amount of phosphoprotein available for immunoprecipitation increased 
with serum stimulation. 

[00186] Precedent for phosphorylation-dependent translocation of transcription factors 
is well established (Vandromme, M. et al., Trends Biochem. Sci. 21:59-64 (1996)). Several 
potential phosphorylation sites flank the four consensus nuclear localization signals in hCdc5 
10 (Figure 1A). In addition, these nuclear localization signals and phosphorylation sites are 
; »=. conserved in S. pombe Cdc5. Whether phosphorylation regulates translocation, DNA binding, or 
'£1 activation for these putative transcription factors remains to be determined. 
in [00187] The relatedness of hCdc5 to S. pombe Cdc5 prompts speculation about its 

^ possible function. The S. pombe molecule has been implicated in the regulation of cell division, 
435 specifically at the G2/M transition (Nasmyth, K. et al., Mol. Gen. Genet. 182:119-124 (1981); 
T Ohi, R. et al., EMBO J. 13:471-483 (1994)). hCdc5 is almost certainly a serum-regulated 
P transcription factor. Taken together with its similarity to S. pombe Cdc5 and its ubiquitous 
O expression, this suggests a possible role for hCdc5 in cell cycle control. 

[00188] Similarities and differences between hCdc5 and c-Myb also are worthy of 
^20 comment. The DNA binding ability of the helix-turn-helix motif seen in hCdc5 has been well 
characterized (Nelson, H.C., Cur. Opin. Genet. Dev. 5:180-189 (1995)). In contrast with the 
DNA binding domain seen in Myb, both hCdc5 and S. pombe Cdc5 contain only two tandem 
repeats of the helix-turn-helix motif, whereas Myb family members possess three. Moreover, 
within this domain both hCdc5 and S. pombe Cdc5 bear a valine to leucine substitution at a 
25 position critical for DNA binding specificity (Ogata, K. et al., Nat. Struct. Biol. 3:187-187 
(1996); Carr, M.D. et al., Eur. J. Biochem 235:721-735 (1996)). hCdc5 and S. pombe Cdc5, 
therefore, may differ from Myb in their DNA binding properties. As noted above, the expression 
pattern of hCdc5 differs from that of the Myb family. These observations suggest that although 
hCdc5 and S. pombe Cdc5 share similarity with the Myb family, they are likely to have distinct 
30 biological roles. 
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Function of hCdc5 
Cell Culture 

[00189] All cell lines were obtained from the American Type Culture Collection. 
BALBc/3T3 and COS-7 cell lines were maintained in Dulbecco's Modified Eagle's media made 
5 10% bovine calf serum (Life Technologies). Jurkat cells were maintained in RPMI 1640 media 
made 10% feta bovine serum (Life Technologies). Cells were detached for passage and cell 
growth assays using 0.05% trypsin, 0.02% EDTA in 0.9% NaCl. Synchronization of 
BALBc/3T3 -derived cultures in GO was accomplished by washing cells with serum-free media 
and maintaining in media made 0.5% bovine calf serum for 48 hours. Release into Gl was 
10 accomplished by the addition of pre-warmed bovine calf serum to 10%. 

''■B Mammalian Expression Constructs 

in [00190] Plasmid 67F, encoding FLAG-tagged, full-length hCdc5 in pcDNA3 was 

,2 constructed as previously described (Bernstein, H.S. et al., J. Biol. Chem. 272:5833-5837 
415 (1997)). pUHD 15-1, which directs the expression of a fusion of the bacterial tetracycline 
repressor DNA binding domain with the H. simplex virion protein 16 activating domain, was 
obtained from Display Systems Biotechnology. pUH67F, encoding epitope-tagged hCdc5 3' to 
O seven tandem copies of the tet operator and a minimal CMV promoter, was constructed by 
!»t subcloning an Apal/Kspl fragment from pSK67F (Bernstein, H.S. et al., J. Biol. Chem. 
^0 272:5833-5837 (1997)) into the Xbal/Kspl sites of pUHD 10-1 (Display Systems Biotechnology) 
after filling in Apal and Xbal ends with T4 DNA polymerase. pNGFP-Nl, encoding green 
fluorescent protein with F64L and S65T mutations, was obtained from Clontech and used as a 
marker for transfection, (Iavarone, A. et al., Nature 387:417-422 (1997)). 

25 Transfection and Generation of Stable Cell Lines 

[00191] COS-7 and BALBc/3T3 cells were transfected using LipefectAMINE (Life 
Technologies) according to the manufacturer's instructions. Cell line 3T15.8 was generated by 
transfecting plasmid pUHD 15-1 into BALBc/3T3 cells and selecting clones in the presence of 
400 (ig/ml G418 (Geneticin; Life Technologies). Clones were then tested for their ability to 

30 induce expression from the tet operative-CMV promoter in transient transfection assays using a 
luciferase reporter plasmid, pUHC 13-3 (Display Systems Biotechnology). pUHC 13-3 was 
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transfected into G418-resistant clones in the presence or absence of 10 ug/ml tetracycline 
(Sigma). Cells were assayed for luciferase activity using the Luciferase Assay System 
(Promega) according to the manufacturer's protocol. Clone 3T15.8 was found to induce 
expression of luciferase activity in the absence of tetracycline with barely detectable activity in 
5 the presence of tetracycline and was chosen for use in subsequent experiments. This cell line 
was maintained in 400 |ig/ml G418 and 10 ug/ml tetracycline. 

[00192] To generate a stable cell line expressing an epitope-tagged hCdc5 (Bernstein, 
H.S. et al., J. Biol. Chem. 272:5833-5837 (1997)) under control of the tet operator, plasmid 
pUH67F was transfected into cell line 3T15.8 together with plasmid CA10 (obtained from D.O. 
10 Morgan, UCSF), which provided a marker for hygromycin resistance. Tetracycline was added to 
pi the culture media and clones were selected with 200 (ig/ml hygromycin (Boehringer Mannheim). 

Clones were screened for their ability to express hCdc5 in the absence of tetracycline by 
Ul immunoblot analysis as described below. Clone 3T1 5.8.22 showed increased expression 
ri detectable by immunoblot in the presence of tetracycline and was selected for use in the 
; !I5 experiments described. This cell line was maintained in 400 ug/ml G418, 200 u.g/ml 

hygromycin, and 10 ug/ml tetracycline. 
i2 [00193] To assess the effects of constitutive hCdc5 overexpression, tetracycline was 

:i removed from cultures by rinsing adherent cells with media without tetracycline, and re-feeding 
O with fresh media every 2 hours for 6 hours. Maximal expression of exogenous hCdc5 was 
= 20 reached by 36 hours after tetracycline removal (Figure 5), thus experiments were designed to 
allow cells 36-48 hours under tetracycline-free conditions before assaying effects of hCdc5 
overexpression. 

[00194] Transient transfection of Jurkat cells was accomplished by electroporation. 
Briefly, about 10 7 cells in media with serum were placed on ice and mixed with a total of 20 ug 
25 plasmid DNA. Electroporation was performed with a Gene Pulser (Bio-Rad) set at 250 mV, 960 
|iF. Cells were immediately inoculated into 10 ml media with serum and incubated at 37°C under 
5% CO2 for 24 hours, then expanded to 20 ml and incubated an additional 72 hours before 
analysis. 

30 Antibody Production and Immunoblot Analysis 
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[00195] A peptide representing the carboxyl terminus of hCdc5 (SEQ ID NO:l, amino 
acids 769-788) was synthesized at the Biomolecular Resource Center, University of California, 
San Francisco and conjugated to keyhole limpet hemocyanin using an ImJect Activated 
Immunogen Conjugation kit (Pierce) according to the manufacturer's instructions. 
5 Immunizations and fusions were performed essentially as described (Harlow, E. et al., 
Antibodies: A Laboratory Manual Cold Spring Harbor, New York 1988)). Ten days post fusion, 
hybridoma supernatants were screened for antibodies against the peptide by ELISA. Hyridomas 
producing antibodies of interest were subcloned and expanded. Subclones were tested in 
immunoblot assays using hybridoma supernatant preabsorbed with peptide as a negative control. 
10 Clone Bl was used for subsequent experiments. Rabbit polyclonal antisera raised against the 
KLH-conjugated peptide were produced by Animal Pharm Services (Healdsburg). Monoclonal 
antibody M2 directed against the FLAG epitope was obtained from Kodak. Polyclonal 
m antibodies cross-reactive against mouse and human cyclins E, A, and Bl were obtained from 

Santa Cruz Biotechnology. 
,|J5 [00196] Immunoblot analysis was performed essentially as described Bernstein, H.S. et 

iM al., J. Biol. Chem. 272:5833-5837 (1997)). Briefly, cell monolayers were scraped into cold 
P phosphate buffered saline and pelleted at 1500 x g at 4°C. Pellets were resuspended in an equal 
b volume of 2x gel loading buffer (100 mM Tris, pH 6.8, 4% sodium dodecyl sulfate, 20% 
% glycerol, 200 mM dithiothreitol, 0.2% bromophenol blue) and sheared by passage through 21- 
H>0 then 27-gauge syringe needles. Samples were analyzed by electrophoresis in 8% or 10% 
polyacrylamide and transferred to nitrocellulose according to standard protocols. Membranes 
were blocked in TNT (50 mM Tris, pH 7.4, 0.5% sodium chloride, 0.05% Tween-20) made 5% 
non-fat dried milk at room temperature for 30 min., and incubated in the same solution made 
either 10 fig/ml monoclonal antibody, 1:100 dilution with hyridoma supernatant, or 3 |ag/ml 
25 polyclonal antibody at 4°C for 1-3 hours. Blots were then washed in TNT, incubated with 
horseradish peroxidase (HRP)-conjugated goat anti-mouse antibody (Bio-Rad) at 1:10,000 
dilution or HRP-conjugated goat anti-rabbit antibody (Bio-Rad) at 1:5,000 dilution in TNT-5% 
non-fat dried milk at room temperature for 30 min. and washed in TNT. Blots were rinsed in 50 
mM Tris, pH 7.4, 0.5% sodium chloride, then developed using the ECL enhanced 
30 chemiluminescence system (Amersham) according to the manufacturer's instructions. 
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Densitometric analysis was accomplished by scanning representative ECL results into Photoshop 
(Adobe), then measuring signal densities using NIH Image (NIH). 

Immunofluorescence 

5 [00197] Immunofluorescence analysis was performed essentially as described 

(Bernstein, H.S. et al., J. Biol. Chem. 272:5833-5837 (1997)). BALBc/3T3 or 3T1 5.8.22 cells 
were plated on sterile 4.8 cm 2 glass coverslips in low serum and stimulated with 10% bovine calf 
serum for 10 min. Cells were then placed on ice and fixed at room temperature in 4% 
paraformaldehyde in phosphate-buffered saline. Cover slips were washed with quenching 
10 solution (150 mM sodium acetate, pH 7, 0.1% non-fat dried milk in phosphate-buffered saline), 
permeabilized with 0.5% triton X-100 in quenching solution, washed with washing solution (15 
=0 mM sodium acetate, pH 7, 0.1% non-fat dried milk in phosphate-buffered saline), incubated with 
i« either 10 ug/ml monoclonal antibody or 1:10 dilution of polyclonal antiserum in washing 
"~2 solution, washed with washing solution, incubated with 2 ug/ml fluorescein isothiocyanate- 
=Fl5 conjugated goat anti-mouse antibody (Life Technologies) or 5 ng/ml Texas red-conjugated goat 
7 anti-rabbit antibody (Molecular Probes) in washing solution, and washed with phosphate- 
: |J buffered saline, all at room temperature. Cover slips were mounted on slides using Slow Fade 
13 Anti-Fade (Molecular Probes) and examined using a Nikon Microphot-FXA fluorescence 
fl microscope. 

Cell Cycle Analysis 

[00198] For cell cycle distribution analysis of BALBc/3T3 cultures, cells were 
synchronized as described above, then stimulated with serum and analyzed about 10 6 cells at 
each timepoint were fixed in 95% ethanol, stained with propidium iodide (PI), and analyzed for 

25 DNA content using a Becton Dickinson FACScan with an argon ion laser at 488 nm and 250 
mW light output. PI emission signals were acquired using 630/22nm bandpass filter. All 
analyses of DNA content were performed on 20,000 collected events using CellQuest software 
(Becton Dickinson), except where indicated. Two-tailed P values were obtained using the 
Student t test for unpaired data. 

30 [00199] For cell cycle distribution analysis of cells overexpressing hCdc5, actively 

dividing or synchronized 3T15.8.22 or 3T15.8 cells were grown in the presence or absence of 
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tetracycline. Cells were analyzed at indicated timepoints following serum stimulation, or after 
72 hours of active division. Briefly, about 2 x 10 6 cells at each timepoint were stained with PI, to 
identify dead cells, and Hoechst 33342, which provides a measure of DNA content in living 
cells. Flow cytometry was performed using a Becton Dickinson FACStar Plus with dual argon 
5 ion lasers at 488 and 363 nm light output. PI signals were collected as above and Hoechst 
emission signals were acquired using a 470/1 Onm bandpass filter. 

[00200] For analysis of cell cycle distribution in transiently transfected Jurkat cultures, 
about 3 x 10 6 cells were stained with PI and Hoechst 33342. PI and Hoechst emission signals 
were acquired for 10 5 events using a flow cytometrytar Plus as above. Of these, about 20,000 
10 GFP-expressing cells were collected using a 5 3 0/3 Onm bandpass filter and analyzed. 

;f| Results And Discussion 

="j*J Overexpression of hCdc5 Accelerates G2/M Progression 

y [00201] We initially attempted constitutive overexpression of hCdc5 in a variety of cell 

■M5 types but were unable to generate stable lines, suggesting that constitutive expression at high 
?iU levels was incompatible with cell survival. We therefore generated a BALBc/3T3-based cell 
O line, designated 3T1 5.8.22, in which hCdc5 was expressed under control of a tetracycline- 
5 regulatable promoter (Gossen, M. et al., Proc. Natl. Acad. Sci. U.S.A. 89:5547-5551 (1992); 
2 Resnitzky, Mol. Cell Biol. 14:1669-1679 (1994)). Immunoblotting demonstrated that in the 
H20 absence of tetracycline, recombinant hCdc5 was expressed at levels about 2-fold above that of 
the endogenous protein (Figure 5). Like native hCdc5, epitope-tagged recombinant hCdc5 
translocated to the nucleus upon serum stimulation. 

[00202] In order to assess the effect of increased hCdc5 expression on cell cycle 
progression, 3T15.8.22 cells were synchronized in GO with low serum then released in Gl and 
25 measured for DNA content as a function of time (Figure 6, panels A-H). Induction of hCdc5 by 
removal of tetracycline caused accelerated progression through the cell cycle; cells returned to 
Gl by 18 hours after release from quiescence in the absence of tetracycline compared with 24 
hours in the presence of tetracycline. By contrast, the parent cell line 3T15.8, which expressed 
the tet repressor-VP16 fusion but not exogenous hCdc5, return to Gl by 24 hours without any 
30 tetracycline-dependent effect. This cell cycle acceleration was maintained over time (Figure 7). 
Over 72 hours, cells overexpressing hCdc5 exhibited an abbreviated cell cycle length of about 19 
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hours, while cells in which recombinant hCdc5 expression was repressed by tetracycline grew at 
a slower rate with a cycle length of about 28 hours. The cell cycle length in 3T15.8 cultures was 
about 26 hours, unaffected by tetracycline, and similar to that seen in 3T 15.8.22 cultures in the 
presence of tetracycline. 

[00203] To determine which phase of the cell cycle was shortened by overexpression of 
hCdc5, we analyzed asynchronously dividing 3T1 5.8.22 and 3T15.8 cultures grown in the 
presence and absence of tetracycline (Figure 8). In 3T15.8.22 cultures, a highly significant 
tetracycline-dependent decrease in the calculated length of G2/M was noted (1.9 h. vs. 7.4 h; 
pO.OOOl). A smaller effect in S phase was also seen (2.2 h. vs. 5 h.; p<0.05). Control 3T15.8 
cultures showed no significant tetracycline-dependent effect. Thus, hCdc5 overexpression 
significantly accelerates G2/M. 

Overexpression of hCdc5 Overcomes a G2/M Size Checkpoint 

[00204] In fission yeast, cells must reach a size threshold in order to enter mitosis 
(Nurse, P., Nature 256:547-551 (1975); Russell, P. et al., Cell 49:559-567 (1987)). Since 
mammalian cells overexpressing hCdc5 demonstrated an appreciable increase in growth rate and 
acceleration through G2/M, we asked whether overexpression enabled cells to bypass size 
requirements for G2 progression and mitotic entry. 3T 15.8.22 cultures were synchronized in GO 
by serum deprivation in the presence or absence of tetracycline then released into Gl and 
measured for DNA content and cell size by flow cytometry. In the absence of tetracycline, 
3T1 5.8.22 cultures showed a decrease in average cell size as they completed a division cycle and 
entered GO. Moreover, unlike the parent cell line or 3T1 5.8.22 cultures in the presence of 
tetracycline, they failed to exhibit the expected increase in size as they approached mitosis 
(Figure 9). In addition, asynchronously dividing 3T 15.8.22 cultures maintained this smaller 
average cell size in the absence of tetracycline over time. This suggested that overexpression of 
hCdc5 allowed cells to proceed through G2, bypassing the normal size checkpoint for mitotic 
entry. 

Expression of Mutant hCdc5 Delays Mitotic Entry 

[00205] To further examine the role of hCdc5 in G2 progression, we designed mutants 
which were predicted to inhibit the cellular effects of hCdc5 in a dominant negative manner. 
One mutant, designated A675, contained the DNA binding and nuclear localization domains of 
hCdc5 but lacked its putative transcriptional activation domain (Figure 10). This mutant should 
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compete with wild-type hCdc5 in binding to its as yet unidentified target promoters but should 
fail to activate them. The human T cell Leukemia line (Jurkat) was co-transfected with vector 
containing either wild-type hCdc5, mutant A675, or vector alone and with a plasmid directing the 
constitutive expression of green fluorescent protein (GFP) as a marker for transfection (Iavarone, 
5 A. et al., Nature 387:41 7-422 (1 997)). Cells transfected with the A675 DNA expressed a protein 
of the expected molecular weight by immunoblot, and immunostaining confirmed that the 
expressed protein localized to the nucleus. GFP-positive cells in asynchronously dividing 
cultures were analyzed for cell cycle distribution by flow cytometry (Figure 11). Cells 
expressing A675 mutant accumulated in G2/M (Figures 4B and 5), while cells transfected with 
10 the empty vector had a phase distribution similar to untransfected cells. Cultures transfected 
with wild-type hCdc5 distributed toward Gl , as would be predicted from experiments with stable 
lines described above. Co-expression of wild-type hCdc5 with A675 titrated the mutant's effects 
Ijft of G2/M accumulation (Figure 12), consistent with the mutant acting specifically in the hCdc5 
pathway. 

4\s [00206] We next used mutant A675 to ask whether cells in which hCdc5's effects were 

i\ inhibited were delayed in G2 or in mitosis. Jurkat cultures co-transfected as above were 
^1 examined for chromosomal condensation at various times after exposure to nocodazole, which 

causes cell cycle arrest prior to metaphase in these cells (Krek, W. et al., Methods Enzymol. 
Q 254:114-124 (1995); Kung, A.L. et al., Proc. Natl. Acad. Sci. U.S.A. 87:9553-9557 (1990)). 
'°20 GFP-expressing cells transfected with A675 mutant exhibited a significantly slower rise in the 

fraction of cells in prophase over time compared with empty vector transfectants (Figure 13). 

Cells expressing wild-type hCdc5 displayed a slightly faster rise in prophase index compared 

with empty vector, and a significantly accelerated increase compared with cells expressing the 

mutant. Since nocodazole disrupts mitosis shortly after chromosomal condensation, these results 
25 indicate that hCdc5 acts at least in part during interphase. Taken together with the flow 

cytometry data, these observations strongly indicate that hCdc5 regulates G2 progression and 

mitotic entry. 

hCdc5 Expression is Not Cell Cycle-Dependent 

[00207] Many proteins controlling cell cycle progression are regulated at the level of 
30 expression and degradation (Morgan, D.O., Nature 374:131-134 (1995)). We therefore asked 
whether expression of hCdc5 was cell cycle-dependent. BALBc/3T3 cells were synchronized in 



500471 V2/PA 
@q5z02!.DOC 



56. 



GO by serum depletion, then released into Gl. Whole cell ly sates from cultures harvested at 
various times after release from quiescence were analyzed by immunoblotting for hCdc5 or 
specific cyclins (Figure 14 A). Cell cycle synchronization was confirmed by flow cytometry and 
analysis of DNA content (Figure 14B). hCdc5 protein levels were low in quiescent cells, but 
5 increased rapidly upon serum stimulation and remained constant throughout the cell cycle. 
Immunolocalization studies showed that hCdc5 moved rapidly from the cytoplasm to the nucleus 
upon serum stimulation of quiescent cells, but in dividing cells hCdc5 remained in the nucleus 
throughout the cycle. Thus, hCdc5 is regulated at the level of subcellular location and protein 
expression in cells entering the division cycle, but not in actively dividing cells 
10 [00208] The results described above indicates hCdc5 might be regulated in two distinct 

ways. In quiescent cells, the rapid translocation of hCdc5 from cytoplasm to nucleus and 
;Q upregulation of its expression upon addition of serum indicates regulation by extracellular 
signals. Available data do not as yet reveal a role for hCdc5 in GO or Gl, thus the functional 
%H result of hCdc5's early upregulation and nuclear localization is unclear. Without being limited to 
l55 any one theory, hCdc5's movement to the nucleus might serve to promote its own expression in 
;y preparation for later cell cycle events. Such positive autoregulation has been described for c- 
Q Myc (Leone, G. et al., Nature 387:422-426 (1997); Zornig, M. et al., Curr. Biol. 6:1553-1556 
5=1 91996)). Alternatively, localization and expression of hCdc5 might be regulated independently. 
?aI Regardless of how hCdc5 comes to be expressed and localized in the nucleus of dividing cells, 
} MO the data described above indicates that hCdc5 activity is not regulated at the level of expression 
or localization in actively dividing cells. 

[00209] We previously demonstrated that hCdc5 becomes hyperphosphorylated in 
serum-stimulated COS-7 cells (Bernstein, H.S. et al., J, Biol. Chem. 272:5833-5837 (1997)). 
Moreover, a X. laevis homolog of hCdc5 recently was identified as an in vitro substrate of Cdc2 
25 kinase (Stukenberg, P.T. et al., Curr. Biol. 7:338-348 (1997)). The similarity between the weel 
phenotype in S. pombe and that of mammalian cells overexpressing hCdc5 is tantalizing in this 
context. These observations indicate that hCdc5 may be a substrate for and transcriptional 
effector of Cdc2 in mammalian cells, and predict its negative regulation by Weel and positive 
regulation by Cdc25B. 

30 [00210] hCdc5, S. pombe Cdc5, and other Cdc5-like proteins are putative transcription 

factors by virtue of their primary structures (Bernstein, H.S. et al., J. Biol. Chem. 272:5833- 
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5837 (1997); Hirayama, T. et al., Proc. Natl. Acad. Sci. U.S.A. 93:13371-13376 (1996); Ohi, R. 
et al., EMBO J. 13:471-483 (1994); Stukenberg, P.T. et al., Curr. Biol. 7:338-348 (1997); 
Wilson, R. et al., Nature 368:32-38 (1994)). The genes that they regulate are unknown. As 
discussed above, it is possible that they are transcriptional effectors for Cdc2 kinase and regulate 
5 expression of genes necessary for G2 progression and mitotic entry. It also is possible that they 
act upstream of Cdc2, promoting its activation by regulating the expression of kinases and 
phosphatases that control Cdc2 (see above). 

[00211] Thus, based on its primary structure, hCdc5 is almost certainly a DNA-binding 
protein and very likely regulates transcription (Bernstein, H.S. et al, J. Biol. Chem. 272:5833- 
10 5837 (1997)). Overexpression of hCdc5 shortened G2 in mammalian cells and decreased cell 
size, presumably by bypassing a G2/M size checkpoint. A dominant negative hCdc5 mutant 
% delayed mitotic entry. These findings strongly indicate that hCdc5 is a functional homolog of S. 
* :j pombe Cdc5 and as such. These studies also confirm hCdc5, as the first described DNA binding 
%j protein which regulates mitotic entry in mammalian cells. 

■0 Identification of a hCdc5 DNA binding site 
p Materials and strains. 

% [00212] Plasmids pBM2389 and pBM2463, and S, cerevisiae strain YM4271, were 

ffl generously provided by Mark Johnston (Washington University School of Medicine). 
y20 Polyclonal antisera to hCdc5 were generated as previously described (Bernstein, H. S., and 
Coughlin, S. R. (1998) Journal of Biological Chemistry 273: 4666-71). 
Expression and purification of recombinant hCdc5 . 

[00213] The DNA-binding domain of the hCdc5 protein was expressed as a His 6 -fusion 
protein in E. coli by subcloning cDNA into pET29c(+) (Novagen) under control of the T7 
25 promoter//ac operator, and inducing with IPTG over time. His 6 -tagged protein was purified from 
bacterial lysates by nickel-affinity chromatography (See, Van Dyke et al. Gene (1992) 111:99- 
104), bacterial protein was isolated under native conditions, resolved by SDS-PAGE, and stained 
with Coomassie R-250. 

[00214] In order to express the DNA-binding domain of hCdc5, unique EcoRV and 
30 Sail sites were inserted into pSK67 (Bernstein, H. S., and Coughlin, S. R. (1997) Journal of 
Biological Chemistry 272(9), 5833-7) using site-directed mutagenesis (Kunkel, T. A., Roberts, J. 
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D., and Zakour, R. A. (1987) Meth Enzymol 154, 367-382.). The 1500 bp cDNA fragment 
encoding 500 amino acids from the amino terminus of hCdc5 was then subcloned into 
EcoRV/Sall sites in pET29c(+) (Novagen). The resulting plasmid was transformed into E. coli 
strain BL21 (DE3), and individual colonies were inoculated into 500 ml liquid LB medium made 
5 50 jig/ml kanamycin. Cultures were grown at 37°C to OD600 0.2-0.6. Cultures were induced 
with isopropyl p-D-thiogalactopyranoside (IPTG) at final concentration 1 raM for 2 hours. 
Bacterial pellets were collected by centrifugation and lysed in 50 mM NaH 2 P0 4 , pH 8.0, 300 
mM NaCl, 10 mM imidazole, 1 mg/ml lyso2yme at 4°C for 1 hour followed by sonication. The 
lysate supernatant then was added to 0.5 ml 50% Ni-NTA resin (Qiagen) and mixed gently at 
10 room temperature for 60 min. The resin was washed with 50 mM NaH 2 P0 4 , pH 8.0, 300 mM 
NaCl, 20 mM imidazole, and the His 6 -tagged protein eluted with 50 mM NaH 2 P0 2 , pH 8.0, 300 
iQ mM NaCl, 70 mM imidazole. The protein then was dialyzed in 10 mM Tris, pH 7.9, 60 mM 
in KC1, 4 mM MgCl 2 at 4°C overnight. Dialysed protein was analyzed by SDS-polyacrylamide gel 
y electrophoresis, and Immunoblot using monoclonal anti-His antibody (Invitrogen) according to 
s |i 5 methods described in Bernstein and Coughlin (1997). Chemical cross-linking was accomplished 
?l " by incubating 100 ng dialyzed protein in 75 mM sodium phosphate buffer, pH 7.0, with or 
O without 0.001% glutaraldehyde, according to published methods (Ransone, L. J. (1995) Meth 
q Enzymol 254: 491-7). Cross-linked products and untreated controls were evaluated using SDS- 
IJ; polyacrylamide gel electrophoresis and immunblot analysis with monoclonal anti-His antibody 
l»€0 (Invitrogen). 

Oligonucleotide selection . 

[00215] Oligonucleotide selection was performed with a DNA fragment that contained 
a 15 bp random sequence flanked on either side by 20 bp of nonrandom sequence (5'- 
CGCTCGAGGGATCCGAATTC(Ni 5 )TCTAGAAAGCTTGTCGACGC-3' SEQ ID NO: 17). 

25 The single-stranded oligonucleotide was made double-stranded with Taq DNA polymerase and a 
primer complementary to the 3' flanking sequence (5'-GCGTCGACAAGCTTTCTAGA-3' 
SEQ ID NO: 18). After extraction with phenol/chloroform/isoamyl alcohol and precipitation 
with glycogen/ethanol, 60 picomoles of the double-stranded oligonucleotide pool were incubated 
with 0.5 ug purified His 6 -hCdc5 in binding buffer (10 mM HEPES, pH 7.8, 75 mM KC1, 2.5 mM 

30 MgCl 2 , 3% Ficoll, 40 ug/ml poly(dl-dC), in the presence of 1 ug/ml phenylmethylsulfonyl 
fluoride, 0.4 ng/ml aprotinin, 1 ng/ml leupeptin, 0.4 ng/ml soybean trypsin inhibitor, 0.4 ng/ml 
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pepstatin A, 10 ng/ml benzamidine) at room temperature for 30 min. The binding solution then 
was incubated with 50% Ni-NTA agarose at room temperature for 10 min. After washing the 
resin extensively with binding buffer, the DNA-protein complexes were eluted with binding 
buffer made 250 mM imidazole. 
5 [00216] Bound oligonucleotides were amplified with Taq polymerase over 15-20 

cycles by annealing to flanking primers (5 '-GCGTCGACAAGCTTTCTAGA-3 ' SEQ ID NO: 18 
and 5'- CGCTCGAGGGATCCGAATTC-3 ' SEQ ID NO: 19) at 55°C for 20 sec and extending 
at 72°C for 10 sec. Subsequent rounds of binding were accomplished as above with 0.1 \xg 
recombinant protein and 1/1 0 th of the unpurified PCR product. After 3-8 cycles of binding, 
10 selection, and amplification, PCR products were cloned into a TA vector (Invitrogen) and 
sequenced. The DNA sequences of 5 products of the third round, five products of the sixth 
j'« round and five products of the eighth round are shown in Figure 15. The five products of the 

eighth round all have the same sequence. This is the consensus hCdc5 binding sequence. 
"■■■4 Preparation of cell extracts and electrophoretic mobility shift assay . 

''%5 [00217] HeLa cell extracts were prepared from subconfluent cultures by douncing cell 

•O pellets on ice in 10 mM Tris-Cl, pH 7.9, 1 mM EDTA, 5 mM DTT, then adding 4 volumes of 50 
ri mM Tris-Cl, pH 7.9, 10 mM MgCl 2 , 25% sucrose, 50% glycerol, 1 volume of saturated 
■» (NFLO2SO4, and stirring at 4°C for 20 min. All subsequent steps were performed at 4°C. After 
tu centrifugation at 1 00,000 x g for 90 min, supernatants were transferred to fresh tubes, to which 
y£0 0.3 g (NH4) 2 S04 was added per ml supernatant. NaOH then was added to neutralize the solution. 
Extracted proteins were collected by centrifugation at 10,000 x g for 30 min, and resuspended in 
20 mM HEPES, pH 7.9, 50 mM KC1, 0.1 mM EDTA, 1 mM DTT, 20% glycerol. These then 
were dialyzed against 20 mM HEPES, pH 7.9, 50 mM KC1, 0.1 mM EDTA, 1 mM DTT, 20% 
glycerol. 

25 [00218] DNA-protein binding was accomplished by incubating 0.8 pmoles of 

recombinant protein or 1 ug cell extract with 0.01 pmole of double-stranded, oligonucleotide 
(labelled with y- 32 P to a specific activity of 10 6 cpm/pmole) in binding buffer at room 
temperature for 30 min. After separation of bound and free DNA by 0.5% non-denaturing 
acrylamide gel electrophoresis, DNA binding activity was quantitated with a Storm 860 

30 Phosphor/Fluor Imager and ImageQuant software (Molecular Dynamics) and calculated as a 
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percentage of DNA bound divided by total DNA (bound/bound+free). Each DNA binding 

experiment was repeated at least three times. 

[00219] Equilibrium dissociation constants (KJ) were determined by incubating a 

constant amount of labeled DNA (1 nM) with increasing amounts of protein (2.5-40 nM) at room 
5 temperature for 30 minutes. After binding activity was quantitated as described above, the Kj 

was calculated with the equation A>=[D][P]/[DP], where [D] is the concentration of free DNA, 

[P] is the concentration of free protein, and [DP] is the concentration of the DNA-protein 

complex. The DNA concentration was limiting relative to the protein concentration to allow the 

approximation [DP]s[DP to tai]- 
10 Determination of the specificity and affinity of the synthetic consensus binding site 

[00220] Labeled consensus sequence identified by selection and amplification was 
S incubated with recombinant His6-hCdc5 in the presence or absence of unlabelled competitors. 
~"± The DNA-protein solution was then separated on a non-denaturing polyacrylamide gel. 
'■4 Increasing amounts of specific competitor (unlabelled consensus sequence at 1 0-900-fold molar 
gl 5 excess) drastically reduced the amount of labelled consensus sequence bound to the His6-hCdc5 
y protein as determined by a gel shift assay, while non-specific competitor (an unlabelled 
□ oligonucleotide of equivalent length and base composition at 30-1000-fold molar excess) did not. 
Z This demonstrated that the DNA-protein interaction was specific. The binding affinity of the 
5 consensus sequence for the hCdc5 protein was determined. Labeled consensus sequence was 
J20 incubated with increasing amounts of recombinant His6-hCdc5. The ratio of bound to free DNA, 

as quantitated by phosphorimage analysis as described above, determined an equilibrium 

Q 

dissociation constant (K</) of approximately 10" molar. 
Scanning mutagenesis of the consensus sequence . 

[00221] Labeled oligonucleotides bearing substitutions in each of the bases comprising 
25 the consensus sequence were incubated with recombinant His6-hCdc5 and gel shift assays were 
carried out as described above to determine the binding affinities for the mutated sequences. 
Removal or substitution of G in the first position, or substitution of C in the seventh position 
dramatically decreased affinity for hCdc5, suggesting that these are important but not critical 
residues. Substitutions at any other position abolished DNA-protein binding implicating these 
30 residues as critical for the interaction. The results are shown in Table 1 . 
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TABLE 1 . Scanning Mutagenesis of the Synthetic Consensus Binding Sequence 



Sequence of Oligo 


Mutation Site 


Fraction of Oligo Gel-shifted in Presence 
of His6-hCdc5 Compared to Control 


GATTTAACATAA 
(SEQIDNO: 13) 


None (control) 


1.0 


ATTTAACATAA 
(SEQ ID NO: 20) 


AG1 


0.5 


TATTTAACATAA 
(SEQ ID NO:21) 


Al 


0.5 


GCTTTAACATAA 
(SEQ ID NO:22) 


A2 


0.3 


GAGTTAACATAA 
(SEQ ID NO:23) 


A3 


0.2 


GATGTAACATAA 
(SEQ ID NO:24) 


A4 


0.2 


GATTGAACATAA 
(SEQ IDNO:25) 


A5 


0.4 


GATTTCACATAA 
(SEQ IDNO:26) 


A6 


0.1 


GATTTACCATAA 
(SEQ IDNO:27) 


A7 


0.7 


GATTTAATATAA 
(SEQ IDNO:28) 


A8 


0.2 


GATTTAACCTAA 
(SEQ ID NO:29) 


A9 


0.2 


GATTTAACAGAA 
(SEQ ID NO:30) 


A10 


0.2 


GATTTAACATCA 
(SEQ ID NO:31) 


All 


0.2 


GATTTAACATAC 
(SEQ ID NO:32) 


A12 


0.2 



1 The bolded bases are the sites of the mutation. 
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Transcriptional Reporter Assay 

[00222] Plasmid p5xCdc5 was constructed by cloning five, tandem copies of the 12 bp, 
hCdc5 consensus binding sequence in place of the GAL4 binding element in pFR-Luc by PCR 
using primers 5 ' -C AAGCTTGC ATGCCTGC AGGTGATTTAAC ATAAGATTTAAC ATAAG A 
5 TTTAACATAAGATTTAACATAAGATTTAACATAAACTCTAGAG-3' (SEQ ID NO: 46) 
and 5 '-CGTGTAC ATCG ACTGAAATCCC-3 ' (SEQ ID NO: 47). p3xG^Z4-3xCdc5 was 
constructed by cloning three copies of the GAL4 binding site in tandem with three copies of the 
hCdc5 binding site in place of the GAL4 binding site of pFR-Luc using primers 5'-CAAGCTTG 
CATGCCTGCAGGTCGGAGTACTGTCCTCCGCCGGAGTACTGTCCTCCGCCGGAGTAC 
1 0 TGTCCTCCGCGATTT AAC ATAAGATTTAACATAAG ATTTAAC ATAAACTCTAGAG-3 ' 
(SEQ ID NO: 48) and 5'-CGTGTA C ATCGACTGAAATCCC-3 ' (SEQ ID NO: 47). pFR-Luc 
1% (Stratagene) contains the entire coding region of Photinus pyralis luciferase downstream of 5 
^ tandem repeats of the GAL4 binding element and a basic promoter (TAT AT A). 
Cj [00223] HeLa cells were transiently transfected using LipofectAMINE Plus (GIBCO) 

according to the manufacturer's instructions, and luciferase activity was assayed in whole cell 
m lysates using the Dual-Luciferase Reporter Assay System (Promega). pM3-VP16 
ri (CLONTECH), encoding a GAL4-VP16 fusion, was used as a positive control. pRL-TK 
(Promega), encoding Renilla reniformis luciferase downstream of a Herpes TK promoter, was 
fx] included in each transfection for normalization of transfection efficiency. 

jj0 [00224] To test whether interaction between hCdc5 and its binding site could activate 

transcription, we transfected HeLa cells (known to express hCdc5) with a plasmid (p5xCdc5) 
containing the hCdc5 binding site upstream of a luciferase reporter. Luciferase activity was 
increased 28-fold in HeLa cells transfected with the plasmid containing the hCdc5 binding site, 
while activity was not detected in cells transfected with the same plasmid in which the binding 

25 site for GAL4 was substituted for the hCdc5 binding site (pFR-Luc). To determine whether 
binding by endogenous hCdcS was responsible for the observed reporter activity, we co- 
expressed the dominant negative hCdc5 mutant, A675, containing only the amino terminal DNA 
binding domain. This mutant is transcriptionally inactive, and delays G2/M in transfected cells. 
In HeLa cells, the A675 mutant abrogated reporter activity, presumably by competing with 

30 endogenous hCdc5 for binding to the consensus site, and blocking activation of the reporter. 
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These data demonstrate that specific interaction of endogenous hCdc5 with its selected binding 
sequence is capable of activating transcription. 
Binding site selection in yeast . 

[00225] The method for selection of human genomic binding sites was adapted from a 
5 similar system used to define the binding specificity of NGFI-B (Wilson, T. E., Fahrner, T. J., 
Johnston, M., and Milbrandt, J. (1991) Science 252:1296-300). The library or UAS (upstream 
activating sequence)-reporter plasmid was constructed by inserting Sa«3AI-digested, human 
genomic DNA into the unique BamWl site of pBM2389 (Liu, J., Wilson, T., Milbrandt, J., and 
Johnston, M. (1993) Methods: A Companion to Methods in Enzymology, 5: 125-137), upstream 
10 of an inactive GAL1 promoter. The activator plasmid was constructed by cloning sequence 
encoding the first 120 amino acids of hCdc5 into unique NotllXhol sites of pBM2463 (Liu et al. 
% 1993, supra), placing it in-frame between the lex A DNA binding domain and GAL4 activating 
"~~± domain. 

s.\ [00226] Yeast were manipulated according to standard protocols (Guthrie, C, and Fink, 

y 5 G. (eds) (1991) Guide to Yeast Molecular Genetics and Molecular Biology Vol. 194. Methods in 
% ti Enzymology, Academic Press, Inc., San Diego). UAS and activator plasmids were co- 
r! transformed into S. cerevisiae strain YM4271 (MATa ura3-52 his3-A200 ade2-101 lys 2-801 
P trp 1-901 gal4-A512 gal80-A538 ade5::hisG) (Liu et al. 1993, supra), to allow for red-white 
^ selection with ADE5, and counter-selection with 5-FOA. Non-sectoring, His + colonies were 
\J2Q patched to media containing 0.1% 5-FOA, and library plasmids that allowed activator-dependent 
growth in His" media, as confirmed by growth failure in 5-FOA, were selected for sequencing. 
Results and Discussion 

The DNA binding domain of hCdc5 binds specifically and with high affinity to a 12 base 
pair, double- stranded DNA sequence . 
25 [00227] To determine whether hCdc5 interacts with specific consensus sequences 

through its consensus DNA binding domain, we performed selection and amplification from a 
pool of random oligonucleotides (Blackwell, T. K. (1995) Meth Enzymol 254, 604-18). We first 
expressed and purified the amino terminus of hCdc5 (amino acids 1-500) in E. coli, then used 
this to select a preferential binding site from a degenerate pool of 1 5 bp oligonucleotides flanked 
30 by conserved sequences to facilitate amplification and cloning. After 8 cycles, a consensus 12 
bp sequence, GATTTAACATAA (SEQ ID NO: 13), was identified (Figure 15). 
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[00228] We then used an electrophoretic mobility shift assay to evaluate the DNA- 
protein interaction. Competition with unlabeled, consensus oligonucleotide demonstrated that 
the DNA binding domain of hCdc5 binds the double-stranded target sequence specifically. 
Binding did not occur with single-stranded sense or antisense oligomer. Titration with 
5 increasing amounts of purified, recombinant protein enabled us to calculate an equilibrium 
dissociation constant of ~10" 8 molar. This value is comparable to other helix-turn-helix DNA 
binding domains, such as that published for c-Myb (Krieg, J., Oelgeschlager, M, Janknecht, R., 
and Luscher, B. (1995) Oncogene 10: 2221-8). These experiments demonstrated for the first time 
that hCdc5 can bind with high affinity to a specific, double-stranded DNA sequence. It also 
10 confirms that the binding specificity for Cdc5 in higher eukaryotes is markedly distinct from 
even the extended consensus sequences reported for c-Myb (Weston, K. (1992) Nucleic Acids 
% Research 20: 3043-9; Howe, K. M., and Watson, R. J. (1991) Nucleic Acids Research 19:3913- 

'as? 

H 9). 

I PS 

u ; 

\i The hCdc5 target sequence contains essential palindromic and helix-turn-helix binding 

'-^.5 motifs . 

45 [00229] To determine which nucleotides within the consensus sequence were necessary 

ri for protein binding, we next performed scanning mutagenesis of the sequence. This revealed that 
the core "ANCA" motif, commonly found in the binding site for other Myb-related, helix-turn- 
ffj helix DNA binding proteins (Sala, A., Saitta, B., De Luca, P., Cervellera, M. N., Casella, I., 
j jo Lewis, R. E., Watson, R., and Peschle, C. (1999) Oncogene 18(6), 1333-9; Mcintosh, P. B., 
Frenkiel, T. A., Wollborn, U., McCormick, J. E., Klempnauer, K. H., Feeney, J., and Carr, M. D. 
(1998) Biochemistry 37(27), 9619-29; Suzuki, A., Wu, C. Y., Washida, H., and Takaiwa, F. 
(1998) Plant and Cell Physiology 39(5), 555-9; Oda, M., Furukawa, K., Ogata, K., Sarai, A., and 
Nakamura, H. (1998) Journal of Molecular Biology 276(3), 571-90; Ording, E., Bergholtz, S., 
25 Brendeford, E. M., Jamin, N., and Gabrielsen, O. S. (1996) Oncogene 13(5), 1043-51) was 
essential for binding. It also demonstrated that the flanking palindrome, TTA/TAA, increased 
binding affinity. To further investigate whether DNA binding was specific for the helix-turn- 
helix motif, as suggested by the "ANCA" core sequence, we expressed a truncation of the hCdc5 
amino terminus (amino acids 1-120), containing only the helix-turn-helix domain, as well as a 
30 Cdc5 mutant in which the helix-turn-helix motif was disrupted by W->G substitutions at 
positions 31, 53, and 82 in the human peptide sequence. These substitutions replace the core 
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tryptophan residues essential to the helix-turn-helix structure, and correspond to the W33G 
W52G W84G cefl mutant previously reported as unable to rescue growth of a cefl-Al::HIS3 
mutant (Ohi et al., 1998). While the helix-turn-helix domain alone bound the target sequence 
with the same specificity and affinity as the longer, amino-terminal peptide (amino acids 1-500), 
the W31G W53G W82G mutant did not bind the consensus sequence by mobility shift assay 
(data not shown). Together with the data demonstrating that the "ANCA" core sequence was 
essential for hCdc5 binding, these results suggest that binding is specific for the helix-turn-helix 
motif. 

[00230] Our observation that a T->G substitution at position 4 or A->C substitution at 
position 12 abolished the DNA-protein interaction suggested that the symmetry of the flanking 
sequences was important. To determine whether the specific sequences or the symmetry 
provided by the palindrome was critical for binding, we tested additional mutants that created 
new palindromes flanking the core sequence, namely GATvfTAACATAT' (SEQ ID NO:49) and 
GATGTAACATAC (SEQ ID NO:50). Neither of these mutants bound Cdc5 in a mobility shift 
assay (data not shown). 

hCdc5 forms homodimers in vitro. 

[00231] To test the hypothesis that hCdc5 is capable of forming homodimers, and 
determine the regions of the protein that interact, we expressed and purified domains of hCdc5 in 
E. coli as His 6 fusions, and studied their ability to dimerize by gluteraldehyde cross-linking. We 
found that both the DNA binding domain (amino acids 1-120) and the putative nuclear 
localization domain (amino acids 205-358) (Bernstein, H. S., and Coughlin, S. R. (1997) Journal 
of Biological Chemistry 272(9), 5833-7) are capable of forming homodimers in vitro, while the 
central (amino acids 365-501) and carboxyl terminal (amino acids 491-802) domains do not 
multimerize under these conditions. In addition, we found that heterodimerization between 
domains of the protein did not occur. 

[00232] These results demonstrated that hCdc5 is capable of forming homodimers 
through its amino terminus. The increased affinity of hCdc5 for its consensus binding site 
conferred by flanking palindromes suggests that hCdc5 may preferentially bind to specific 
targets as a homodimer. Whether homodimerization is necessary for DNA binding in vivo, 
however, is not known. These findings also may explain why others have been unable to 
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identify specific binding sites for Cdc5 family members (Ohi, R., Feoktistova, A., McCann, S., 
Valentine, V., Look, A. T., Lipsick, J. S., and Gould, K. L. (1998) Molecular and Cellular 
Biology 18(7), 4097-108; Hirayama, T., and Shinozaki, K. (1996) Proc Natl Acad Sci USA 
93(23), 13371-6). We performed cyclic amplification and selection of targets with the first 500 
residues of the amino terminus of hCdc5, which included all of the sequence necessary for 
homodimerization. Others have reported using only the helix-turn-helix repeats for this purpose, 
which may not form the stable homodimers required for high-affinity binding to preferential 
DNA targets. 

Interaction of hCdc5 with target sequences occurs in vivo . 

[00233] While cyclic amplification and selection of targets can identify sequences 
capable of interacting with a DNA binding protein under cell-free conditions, we wanted to test 
whether the consensus sequence we had identified was able to select endogenous hCdc5 from 
cell extracts. A gel shift assay, performed with the consensus oligonucleotide and HeLa cell 
extract, demonstrated a single shifted band. To determine whether this band comprised 
endogenous hCdc5 complexed with the oligomer, we used antibodies specific for the amino and 
carboxyl termini of hCdc5 in an electrophoretic mobility supershift assay. Both antibodies were 
able to supershift the DNA-protein complex, confirming that the protein selected from HeLa cell 
extracts was hCdc5. This finding demonstrates the ability of the consensus oligonucleotide to 
bind hCdc5 in vivo. 

hCdc5 interacts preferentially with specific sequences in the human genome . 

[00234] To determine whether hCdcS binds specific sites in the human genome, we 
employed a selection system in yeast that previously was used to define the binding specificity of 
the nerve growth factor induced-B gene product (Wilson, T. E., Fahrner, T. J., Johnston, M., and 
Milbrandt, J. (1991) Science 252(5010), 1296-300). We constructed two plasmids to express in 
S. cerevisiae. The library or UAS (upstream activating sequence)-reporter plasmid consisted of 
the HIS3 gene placed downstream of an inactive GAL1 promoter missing the GAL1 UAS 
(Ording, E., Bergholtz, S., Brendeford, E. M., Jamin, N., and Gabrielsen, O. S. (1996) Oncogene 
13(5), 1043-51). We then inserted size-selected, human genomic DNA (1-4 kb) upstream of the 
inactive GAL1 promoter as a source of potential DNA binding sites. The activator plasmid 
encoded an epitope-tagged fusion of hCdc5's amino terminus with the activating domain of 
GAL4 (Brent, R., and Ptashne, M. (1985) Cell 43(3 Pt 2), 729-36) under control of a strong, 
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constitutive promoter (i.e., ADH1). Yeast cells expressing the hybrid transcriptional activator 
and carrying a library plasmid containing binding sites for hCdc5 expressed HIS3 and were able 
to grow in histidine-deficient media. 

[00235] To avoid the anticipated expression of HIS3 as a result of UAS recognition by 
an endogenous yeast transcriptional activator, we took advantage of the ADE5 and URA3 
markers on the activator plasmid. Yeast colonies bearing null mutations in the ade2 and ade5 
genes turn red in the presence of ADE5 (Liu et al., 1993, supra). Colonies that were able to grow 
in histidine-deficient media in the absence of the activator plasmid formed red colonies with 
white sectors, indicating loss of the activator plasmid. We discarded such "sectoring" colonies 
as false positives. In addition, yeast expressing the URA3 gene product die in the presence of 5- 
fluoro-orotic acid (5-FOA), which is metabolized to a toxic metabolite in the presence of the 
enzyme encoded by URA3 (Brent, R., and Ptashne, M. (1985) Cell 43(3 Pt 2), 729-36). Yeast 
transformants that required the presence of the activator plasmid for function of the library UAS 
grew in histidine-deficient media only in the absence of 5-FOA. We also discarded 
transformants that grew in histidine-deficient media in the presence of 5-FOA as false positives. 

[00236] Of ~5 x 1 0 5 transformants screened (representing approximately one-third of 
the human genome), 12 out of 218 non-sectoring colonies were sensitive to 5-FOA. These were 
sequenced and analyzed for homology with known sequences, as well as with each other. All 12 
plasmids contained novel human genomic sequence without homology to any sequences 
available in Genbank, including the Eukaryotic Promoter Database. A subset of these had short 
regions of limited homology (<500 bp) between them, and two in this subset appeared to be 
overlapping clones from the same genomic region with >3kb of overlap. The observation that 
selected clones had extended regions of homology suggested that hCdc5 interacts preferentially 
with certain sequences within the human genome. 

hCdc5's interaction with specific sites in the human genome is mediated by binding to its 

target sequence . 

[00237] To determine whether hCdc5 interacts with specific genomic sequences 
through direct DNA binding, we first surveyed the 12 genomic sequences (totaling -36,200 bp) 
identified by binding selection in yeast for the synthetic consensus binding sequence identified 
by cyclic amplification and selection in the example above. An exact match was not found, 

o 

although this was not surprising as the likelihood of finding a unique 1 2 bp sequence is 6 x 1 0" , 
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and is predicted to occur by chance ~180 times in the human genome. We then used the 
information obtained by scanning mutagenesis to look for sites containing the core "ANCA" 
motif flanked by palindromes within the genomic clones. Six such sites were identified in clones 
6, 8, 12 (2 sites), 25, and 30 (Table 2), of which 3 were able to interact with the amino terminus 
of hCdc5, as demonstrated by electrophoretic mobility gel shift assay (clones 6, 30 and one site 
in clone 12). Complementary oligonucleotides representing the sequences in Table 2 flanked by 
the non-random oligo flanking sequences used in the oligonucleotide selection experiments 
described above were used for the gel shift assays. Two of these active binding sites were found 
in the two clones which represented overlapping sequences from the same region of the genome 
(clones 6 and 30). All 3 sites shared features of the consensus binding sequence: a core 
"ANCA" motif, AT-rich flanking palindromes, and a distance between core and palindromes of 
no more than 3 bp. These experiments suggest that the consensus binding sequence for hCdc5 
can be extended to a core "AWCA" flanked within 3 bp by an AT-rich palindrome. This also is 
the first demonstration that a member of the Cdc5 family interacts with specific genomic 
sequences through direct DNA-protein binding. 



TABLE 2. Sequences of the Putative hCdc5 binding sites on Selected Clones 



Clone 


Protein Binding 


Sequence' 


Consensus 


+ 


GATTrAACATAA 
(SEQIDNO: 13) 


6 


+ 


AA IAAAATCAAAA TT 
(SEQIDNO: 15) 


30 


+ 


AA TAAAAT C AAAA TT 
(SEQIDNO: 15) 


12(1) 


+ 


AAAGGGGAACACTTT 
(SEQIDNO: 16) 


12(2) 




CCCCCCTTTAAACCAGCGTGGAGGGGGG 
(SEQ ID NO: 33) 


25 




AA TTCCCCGGATCATTGCAAACAATT 
(SEQ ID NO: 34) 
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8 




AATGAACGAATCAAATT 






(SEQ ID NO: 35) 



1 The palindromic sequences are shown in italics. The bolded bases are part of the core "ANCA" 



sequence. 

[00238] To test whether the sequence we abstracted from clone 6 was the one 
5 responsible for Cdc5/G,4Z4-dependent activation in yeast, we cloned subfragments of clone 6 
into the UAS-reporter plasmid, and tested their ability to confer activator-dependent growth in 
selective media (data not shown). A 500 bp subfragment containing the identified 15 bp 
sequence was necessary for Cdc5/G^4Z^-dependent activation in yeast, while removing the 1 5 bp 
!=j sequence from this larger fragment abolished activator-dependent growth. Smaller fragments 
;4o containing the 15 bp sequence failed to confer growth in selective media, suggesting that while 
fjl the 15 bp sequence is necessary for activation, other elements contained in the 500 bp clone 6 
;4 subfragment may be required for Cdc5/G^I4-dependent activation in yeast. 
=1= [00239] All references (including books, articles, papers, patents, and patent 

Si applications) cited herein are hereby expressly incorporated by reference in their entirety for all 
j~Jl 5 purposes. 

Q [00240] While the invention has been described in connection with specific 

VP. 

f% embodiments thereof, it will be understood that it is capable of further modification, and this 
application is intended to cover any variations, uses, or adaptations of the invention following, in 
general, the principles of the invention and including such departures from the present disclosure 
20 as come within known or customary practice in the art to which the invention pertains and as 
may be applied to the essential features hereinbefore set forth, and as fall within the scope of the 
invention and the limits of the appended claims. 
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